An open API service providing repository metadata for many open source software ecosystems.

Topic: "data-format"

toon-format/toon

🎒 Token-Oriented Object Notation (TOON) – Compact, human-readable, schema-aware JSON for LLM prompts. Spec, benchmarks, TypeScript SDK.

Language: TypeScript - Size: 1.65 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 20,377 - Forks: 896

lance-format/lance

Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..

Language: Rust - Size: 38.1 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 5,842 - Forks: 501

ron-rs/ron

Rusty Object Notation

Language: Rust - Size: 7.77 MB - Last synced at: about 3 hours ago - Pushed at: 6 days ago - Stars: 3,790 - Forks: 143

apache/carbondata

High performance data store solution

Language: Scala - Size: 82.8 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 1,439 - Forks: 705

bevry/cson

CoffeeScript-Object-Notation. Same as JSON but for CoffeeScript objects.

Language: CoffeeScript - Size: 698 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1,344 - Forks: 55

securisec/chepy

Chepy is a python lib/cli equivalent of the awesome CyberChef tool.

Language: Python - Size: 4.98 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 1,016 - Forks: 60

kinverarity1/lasio

Python library for reading and writing well data using Log ASCII Standard (LAS) files

Language: Lasso - Size: 5.03 MB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 375 - Forks: 160

couchbase/fleece

A super-fast, compact, JSON-equivalent binary data format

Language: C++ - Size: 4.56 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 324 - Forks: 34

ScrapeGraphAI/toonify

Toonify: Compact data format reducing LLM token usage by 30-60%

Language: Python - Size: 1.66 MB - Last synced at: 27 days ago - Pushed at: 30 days ago - Stars: 248 - Forks: 16

NeurodataWithoutBorders/pynwb

A Python API for working with Neurodata stored in the NWB Format

Language: Python - Size: 45.5 MB - Last synced at: 15 days ago - Pushed at: 16 days ago - Stars: 205 - Forks: 90

cheminfo/netcdfjs

Read and explore NetCDF files

Language: TypeScript - Size: 5.31 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 161 - Forks: 31

svenvc/ston

STON - Smalltalk Object Notation - A lightweight text-based, human-readable data interchange format for class-based object-oriented languages like Smalltalk.

Language: Smalltalk - Size: 565 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 140 - Forks: 32

jorinvo/edn-data

EDN parser and generator that works with plain JS data, with support for TS and node streams

Language: TypeScript - Size: 536 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 112 - Forks: 6

HelgeSverre/toon-php

Token-Oriented Object Notation - A compact data format for reducing token consumption when sending structured data to LLMs (PHP implementation)

Language: PHP - Size: 526 KB - Last synced at: 17 days ago - Pushed at: 20 days ago - Stars: 97 - Forks: 7

DahnJ/Awesome-Zarr

🎀 Awesome Zarr resources

Size: 201 KB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 93 - Forks: 1

NaturalIntelligence/nimn-spec

Just Data. Save up to 85% network bandwidth and storage.

Size: 64.5 KB - Last synced at: 8 months ago - Pushed at: about 2 years ago - Stars: 89 - Forks: 5

gmggroup/omf-python

Python library for working with OMF files

Language: Jupyter Notebook - Size: 21.3 MB - Last synced at: almost 2 years ago - Pushed at: almost 3 years ago - Stars: 77 - Forks: 19

bevry/envfile

Parse and write environment files with Node.js

Language: TypeScript - Size: 955 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 62 - Forks: 10

MiraGeoscience/geoh5py

Python API for geoh5, an open file format for geoscientific data.

Language: Python - Size: 35.2 MB - Last synced at: 5 days ago - Pushed at: 7 days ago - Stars: 60 - Forks: 12

NeurodataWithoutBorders/nwb-schema

Data format specification schema for the NWB neurophysiology data format

Size: 67.2 MB - Last synced at: 15 days ago - Pushed at: 16 days ago - Stars: 60 - Forks: 16

PEtab-dev/PEtab

PEtab - an SBML and TSV based data format for parameter estimation problems in systems biology

Size: 11.4 MB - Last synced at: 21 days ago - Pushed at: 23 days ago - Stars: 60 - Forks: 12

eternal-io/keon

A human readable object notation / serialization format that syntactic similar to Rust and completely supports Serde's data model.

Language: Rust - Size: 221 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 59 - Forks: 1

reflektone-games/SimaiSharp

A serializer/deserializer for the rhythm game chart format simai.

Language: C# - Size: 264 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 46 - Forks: 8

ViliOrg/Vili

A nice and readable data format !

Language: C++ - Size: 834 KB - Last synced at: 4 months ago - Pushed at: about 3 years ago - Stars: 27 - Forks: 3

open-gamma-ray-astro/gamma-astro-data-formats

Data formats for gamma-ray astronomy

Language: Python - Size: 9.23 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 27 - Forks: 27

BAMWelDX/weldx

The welding data exchange format

Language: Python - Size: 7.7 MB - Last synced at: 8 days ago - Pushed at: 11 days ago - Stars: 23 - Forks: 9

vim89/toon4s

toon4s: Token-Oriented Object Notation for JVM

Language: Scala - Size: 572 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 23 - Forks: 3

hltcoe/concrete-python

Python modules and scripts for working with Concrete, a data serialization format for NLP

Language: Python - Size: 1.96 MB - Last synced at: 11 days ago - Pushed at: about 2 years ago - Stars: 21 - Forks: 8

JohnnyBravo75/DataBridge.NET

Configurable data bridge for permanent ETL jobs

Language: C# - Size: 11.1 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 20 - Forks: 10

ready-steady/hdf5

Reader and writer of HDF5 files

Language: Go - Size: 28.3 KB - Last synced at: 4 months ago - Pushed at: over 8 years ago - Stars: 18 - Forks: 2

theobori/nix-converter

All-in-one converter configuration language to Nix and vice versa

Language: Go - Size: 88.9 KB - Last synced at: 6 days ago - Pushed at: 10 days ago - Stars: 17 - Forks: 2

hltcoe/concrete

Thrift definitions, making HLT data specifications concrete

Language: Thrift - Size: 4.61 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 16 - Forks: 5

LorisYounger/LinePutScript

LinePutScript是一种数据交换格式定义行读取结构和描述其内容的标准语言

Language: C# - Size: 24 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 13 - Forks: 2

insightindustry/spss-converter

A simple utility that converts SPSS data to / from Pandas DataFrames, CSV, Excel, JSON, YAML, and dict.

Language: Python - Size: 69.3 KB - Last synced at: 29 days ago - Pushed at: almost 3 years ago - Stars: 13 - Forks: 4

iddev5/inon

:floppy_disk: Data serialization format in Zig

Language: Zig - Size: 169 KB - Last synced at: 8 months ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 0

fundsxml/schema

FundsXML XSD Files

Size: 2.12 MB - Last synced at: 18 days ago - Pushed at: 21 days ago - Stars: 9 - Forks: 2

tkarabela/ensight-reader

A pure Python reader for the EnSight Gold format

Language: Python - Size: 438 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 9 - Forks: 4

dnaka91/mabo

Data format and schema, with a type system as strong as Rust's.

Language: Rust - Size: 2.3 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 9 - Forks: 1

burtonageo/bvh_anim

Loader for bvh animation files

Language: Rust - Size: 534 KB - Last synced at: 29 days ago - Pushed at: 6 months ago - Stars: 9 - Forks: 7

monsterkodi/noon

no ordinary object notation

Language: JavaScript - Size: 1.72 MB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 9 - Forks: 1

vimaec/vim

VIM - Runtime 3D BIM Data Format for AEC

Language: C# - Size: 153 KB - Last synced at: 5 months ago - Pushed at: over 2 years ago - Stars: 9 - Forks: 3

erikw/nestedtext-ruby

A ruby implementation of NestedText https://nestedtext.org/

Language: Ruby - Size: 1.96 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 8 - Forks: 1

ason-format/ason

ASON (Aliased Serialization Object Notation) is a serialization format designed to optimize token consumption in LLM (Large Language Model) contexts while maintaining human readability and guaranteeing complete round-trip fidelity.

Language: JavaScript - Size: 1.48 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 7 - Forks: 0

reflektone-games/simai.js

A serializer/deserializer for the rhythm game chart format simai.

Language: TypeScript - Size: 206 KB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 7 - Forks: 2

RNABioInfo/rna-interaction-format

RNA Interaction Format (RIF) with reference implementations in C++, JavaScript, and Python

Language: C++ - Size: 298 KB - Last synced at: 23 days ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 0

nexusformat/exampledata

Examples of (mostly) real world NeXus files to inspect, test and train reading software with.

Language: Python - Size: 45.8 MB - Last synced at: almost 2 years ago - Pushed at: about 3 years ago - Stars: 6 - Forks: 7

cs91chris/flask_response_builder

Implementations of flask response in many formats like: json, xml, html...

Language: Python - Size: 83 KB - Last synced at: 2 months ago - Pushed at: about 3 years ago - Stars: 6 - Forks: 0

OGRECave/DotSceneFormat 📦

Now in the main repository

Language: C++ - Size: 67.4 KB - Last synced at: over 2 years ago - Pushed at: over 6 years ago - Stars: 6 - Forks: 2

HeavyIonAnalysis/AnalysisTree

AnalysisTree data format

Language: C++ - Size: 29.8 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 5 - Forks: 8

metayeti/Lime

Game data packer

Language: C++ - Size: 28.6 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

JoiLa/cdt

convert between different data types easily and conveniently, with golang. support json and sql serialize

Language: Go - Size: 156 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 0

mamantoha/toon-crystal

Crystal implementation of the Token-Oriented Object Notation(TOON) serialization format

Language: Crystal - Size: 93.8 KB - Last synced at: 3 days ago - Pushed at: 5 days ago - Stars: 4 - Forks: 0

jimmystridh/toon-rs

🎨 Rust implementation of TOON (Token-Oriented Object Notation) - A human-readable, token-efficient serialization format for LLMs with full serde integration

Language: Rust - Size: 332 KB - Last synced at: 28 days ago - Pushed at: about 1 month ago - Stars: 4 - Forks: 2

lifs-tools/rmzTabM

The R-language bindings for mzTab-M

Language: R - Size: 1.86 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 4 - Forks: 3

hltcoe/concrete-js

JavaScript library for working with Concrete, a data serialization format for NLP

Language: JavaScript - Size: 4.26 MB - Last synced at: 4 months ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 2

master-co/business

A business data model for quick verification, access and output of specific data formats.

Language: TypeScript - Size: 332 KB - Last synced at: 2 months ago - Pushed at: almost 4 years ago - Stars: 4 - Forks: 0

DFASDL/dfasdl-core 📦

Moved: https://codeberg.org/tensei-data/dfasdl-core

Language: Scala - Size: 93.8 KB - Last synced at: almost 3 years ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 2

Hihaheho/eure

A minimalist, schema- and editor-friendly data language for algebraic data types and concise descriptions of deeply nested data.

Language: Rust - Size: 2.92 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3 - Forks: 0

crawlcore/scp-protocol

Site Content Protocol (SCP) reduces waste of bandwidth & processing power during site crawling.

Language: Python - Size: 49.8 KB - Last synced at: 4 days ago - Pushed at: 6 days ago - Stars: 3 - Forks: 0

AndreaIannoli/TOONIFY

Universal converter for JSON, YAML, XML, and CSV into the TOON format, written in Rust.

Language: Rust - Size: 126 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 0

NeuroJSON/jsnirf

A JSON/binary JSON extension to the SNIRF format

Size: 84 KB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 3 - Forks: 7

deshima-dev/demerge

:truck: DESHIMA merge code for observed datasets

Language: Python - Size: 262 MB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 3 - Forks: 1

kesh-lang/sode

kesh tree structured data format

Size: 93.8 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

nuald/glSatellite-Demo

Technological demo featuring various areas as using OpenGL ES with Android NDK, custom message queues, interacting JNI and Java activities, using NORAD databases and parsing their data.

Language: C++ - Size: 6.21 MB - Last synced at: 9 months ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 1

twesterhout/hdf5-hs

High-level Haskell bindings to HDF5

Language: Haskell - Size: 170 KB - Last synced at: 7 months ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

robianchini/react-data-formatter

A react-data-formatter é uma biblioteca em JavaScript para formatação de dados brasileiros como CPF, CNPJ, CEP, telefone, moeda, placa e gênero.

Language: JavaScript - Size: 616 KB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

ALANVF/reon

REON is an expressive data format based on Red/Rebol syntax that can be converted to/from JSON

Language: CoffeeScript - Size: 176 KB - Last synced at: 3 months ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 0

antgon/edfrw

A Python library for reading and writing European Data Format (EDF) files

Language: Python - Size: 2.86 MB - Last synced at: 11 days ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

abrudz/parsing

Dyalog APL expressions to parse common and unusual data formats from text files

Language: APL - Size: 3.91 KB - Last synced at: 9 months ago - Pushed at: about 4 years ago - Stars: 3 - Forks: 0

kikones34/dsmap-serial-format

Low-level description of the format in which GameMaker Studio serializes ds_map objects.

Size: 1.95 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 0

DFASDL/dfasdl-utils 📦

Moved: https://codeberg.org/tensei-data/dfasdl-utils

Language: Scala - Size: 705 KB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 2

matthewdeanmartin/hissbytenotation

Library to make it easy to use python literal syntax as a data format

Language: Python - Size: 211 KB - Last synced at: 9 days ago - Pushed at: 11 days ago - Stars: 2 - Forks: 0

poseidon-framework/poseidon-schema

An archaeogenetic genotype data organisation file format

Size: 1.03 MB - Last synced at: 14 days ago - Pushed at: 15 days ago - Stars: 2 - Forks: 2

rhinoceros7/flash

Flash is a C library and CLI for append-only, verifiable event streams stored in ".flsh" files.

Language: C - Size: 203 KB - Last synced at: 17 days ago - Pushed at: 21 days ago - Stars: 2 - Forks: 0

iamgerwin/toon-php

A lightweight, fast TOON (Token-Oriented Object Notation) library for PHP. Optimized for LLM contexts. PHP 7.0-8.0 [legacy] and 8.1 and up [modern] support.

Language: PHP - Size: 96.7 KB - Last synced at: 12 days ago - Pushed at: 24 days ago - Stars: 2 - Forks: 0

biomarkersParkinson/tsdf

A package to read, modify and write TSDF data in Python.

Language: Python - Size: 4.5 MB - Last synced at: 27 days ago - Pushed at: 29 days ago - Stars: 2 - Forks: 0

fezcode/piml

Parenthesis Intended Markup Language

Size: 4.88 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

fezcode/piml.js

PIML (Parenthesis Intended Markup Language) encoder/decoder for Javascript

Language: JavaScript - Size: 42 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

fezcode/go-piml

PIML library for go

Language: Go - Size: 17.6 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 2 - Forks: 0

Cactus-minecraft-server/nbt

[CactusMC] Module component responsible for NBT parsing.

Language: Rust - Size: 13.7 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

deshima-dev/dems

:truck: DESHIMA Measurement Set

Language: Python - Size: 180 KB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 2 - Forks: 1

zarr-developers/blog

Zarr Official Blog

Language: Ruby - Size: 436 KB - Last synced at: 8 months ago - Pushed at: 9 months ago - Stars: 2 - Forks: 7

digitallinguistics/DFT

Discourse Functional Transcription

Size: 23.4 KB - Last synced at: 9 months ago - Pushed at: almost 6 years ago - Stars: 2 - Forks: 1

tomhodgins/htmlforever

HTML templating helper functions written in a Continuation-Passing Style for JavaScript, Python, and Ruby

Language: Python - Size: 21.5 KB - Last synced at: over 2 years ago - Pushed at: about 7 years ago - Stars: 2 - Forks: 1

CHollasch/LSON4J 📦

Java LSON parser.

Language: Java - Size: 697 KB - Last synced at: 9 months ago - Pushed at: over 7 years ago - Stars: 2 - Forks: 0

ready-steady/mat

Reader and writer of MATLAB MAT-files

Language: Go - Size: 602 KB - Last synced at: 3 months ago - Pushed at: almost 11 years ago - Stars: 2 - Forks: 0

MiraGeoscience/las-geoh5

Import/Export LAS files to/from GEOH5 format for geoscientific data

Language: Python - Size: 3.37 MB - Last synced at: 5 days ago - Pushed at: 8 days ago - Stars: 1 - Forks: 1

TangYewLabs/corpus-informaticus

CIVD — Corpus-Informaticus Volumetric Data Experimental 3D container + ROI engine for robotics/AI payloads (maps, logs, sensor grids) with capsule metadata.

Language: Python - Size: 21.3 MB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 1 - Forks: 0

naufalhanif25/hron-format

HRON (Hierarchical Reference Object Notation) is a structured text format focused on being readable, compact, and fast to parse.

Language: TypeScript - Size: 348 KB - Last synced at: 18 days ago - Pushed at: 20 days ago - Stars: 1 - Forks: 0

devcom33/json-toon-converter

Convert JSON to TOON format and back - reduce LLM token usage by up to 40%.

Language: JavaScript - Size: 894 KB - Last synced at: 28 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

DigitalCoreHub/laravel-toon

A friendly JSON transformer for Laravel — Convert JSON ↔ TOON, an ultra-minimal, line-based data format.

Language: PHP - Size: 118 KB - Last synced at: 12 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

supervisely-ecosystem/convert-yolov5-to-supervisely-format

YOLOv5 to Supervisely format

Language: Python - Size: 2.19 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 2

vimalnathnambiar/exfilms

A command-line interface tool to extract, filter, and standardise MS data.

Language: JavaScript - Size: 80 MB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

Amnicastro98/JSON-Reference

Size: 8.79 KB - Last synced at: 4 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

peterkuma/pst

Plain Structured Text (PST)

Language: Python - Size: 110 KB - Last synced at: 29 days ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

trilobite-stdlib/trilo-xfile-c

That format framework from the laboratory. Basically, the data format handling functionality for JSON, CSV, XML, and more written in C.

Language: C - Size: 79.1 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 1

AvailLang/avail-json

Read and write freeform JSON with precise error checking.

Language: Kotlin - Size: 160 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

AmateurPanda92/game-data-schema

🗂️ A TypeScript data model and JSON schema for encapsulating game data, with a focus on flexibility and human readability.

Size: 2.93 KB - Last synced at: over 2 years ago - Pushed at: about 3 years ago - Stars: 1 - Forks: 0

Naira0/datax

a custom data format parser.

Language: C++ - Size: 6.84 KB - Last synced at: almost 3 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0

andrevdl/SmartIO

Transforms common data formats from one type to another, such as JSON, XML and datasets

Language: C++ - Size: 220 KB - Last synced at: over 2 years ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 0