An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: chunking-algorithm

chonkie-inc/chonkie-ts

πŸ¦› CHONK your texts with Chonkie ✨ Type-friendly, light-weight, fast and super-simple chunking library

Language: TypeScript - Size: 652 KB - Last synced at: 2 days ago - Pushed at: 21 days ago - Stars: 260 - Forks: 9

chonkie-inc/chonkie

πŸ¦› CHONK your texts with Chonkie ✨ β€” The no-nonsense RAG chunking library

Language: Python - Size: 6.24 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 1,939 - Forks: 102

Haruno19/MST-Semantic-Chunker

A new, experimental text chunking method based on Minimum Spanning Tree clustering with a hybrid semantical-positional distance measure.

Language: Python - Size: 33.2 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

iscc/fastcdc-py

FastCDC implementation in Python https://pypi.org/project/fastcdc/

Language: Python - Size: 339 KB - Last synced at: 11 days ago - Pushed at: about 1 year ago - Stars: 58 - Forks: 17

nlfiedler/fastcdc-rs

FastCDC implementation in Rust

Language: Rust - Size: 257 KB - Last synced at: 14 days ago - Pushed at: 4 months ago - Stars: 158 - Forks: 28

FastPix/android-uploads-sdk

Android Resumable Uploads SDK from Fastpix

Language: Kotlin - Size: 74.2 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

D-X-W-Clerker/clerker-ai

[2024-2] Mermaid λͺ¨λΈμ„ ν™œμš©ν•œ 회의 지원 ν”Œλž«νΌ μ„œλΉ„μŠ€ "Clerker"

Language: Python - Size: 28.8 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 1

isaka-james/chunks-to-file

A nodejs chunking system

Language: JavaScript - Size: 55.7 KB - Last synced at: 4 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0

mg98/ae-chunker-go

Go implementation of the AE chunking algorithm.

Language: Go - Size: 83 KB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 0

davidwrossiter/langchunk

Source code for chunking code in multiple different languages

Language: JavaScript - Size: 6.28 MB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

i5heu/ChunkingChampions

Explore and benchmark the world of data chunking algorithms in 'ChunkingChampions' - a competitive arena to determine the most efficient and effective chunking strategies for varied data sizes.

Size: 564 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

mudssrali/chunkify

a simple utility to split given array into chunks of input size with array reverse option

Language: TypeScript - Size: 105 KB - Last synced at: 6 days ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0