GitHub topics: chunking-algorithm
chonkie-inc/chonkie-ts
π¦ CHONK your texts with Chonkie β¨ Type-friendly, light-weight, fast and super-simple chunking library
Language: TypeScript - Size: 652 KB - Last synced at: 2 days ago - Pushed at: 21 days ago - Stars: 260 - Forks: 9

chonkie-inc/chonkie
π¦ CHONK your texts with Chonkie β¨ β The no-nonsense RAG chunking library
Language: Python - Size: 6.24 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 1,939 - Forks: 102

Haruno19/MST-Semantic-Chunker
A new, experimental text chunking method based on Minimum Spanning Tree clustering with a hybrid semantical-positional distance measure.
Language: Python - Size: 33.2 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

iscc/fastcdc-py
FastCDC implementation in Python https://pypi.org/project/fastcdc/
Language: Python - Size: 339 KB - Last synced at: 11 days ago - Pushed at: about 1 year ago - Stars: 58 - Forks: 17

nlfiedler/fastcdc-rs
FastCDC implementation in Rust
Language: Rust - Size: 257 KB - Last synced at: 14 days ago - Pushed at: 4 months ago - Stars: 158 - Forks: 28

FastPix/android-uploads-sdk
Android Resumable Uploads SDK from Fastpix
Language: Kotlin - Size: 74.2 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

D-X-W-Clerker/clerker-ai
[2024-2] Mermaid λͺ¨λΈμ νμ©ν νμ μ§μ νλ«νΌ μλΉμ€ "Clerker"
Language: Python - Size: 28.8 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 1

isaka-james/chunks-to-file
A nodejs chunking system
Language: JavaScript - Size: 55.7 KB - Last synced at: 4 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0

mg98/ae-chunker-go
Go implementation of the AE chunking algorithm.
Language: Go - Size: 83 KB - Last synced at: 4 months ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 0

davidwrossiter/langchunk
Source code for chunking code in multiple different languages
Language: JavaScript - Size: 6.28 MB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

i5heu/ChunkingChampions
Explore and benchmark the world of data chunking algorithms in 'ChunkingChampions' - a competitive arena to determine the most efficient and effective chunking strategies for varied data sizes.
Size: 564 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

mudssrali/chunkify
a simple utility to split given array into chunks of input size with array reverse option
Language: TypeScript - Size: 105 KB - Last synced at: 6 days ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0
