GitHub / yobix-ai / extractous
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/yobix-ai%2Fextractous
PURL: pkg:github/yobix-ai/extractous
Stars: 1,217
Forks: 56
Open issues: 23
License: apache-2.0
Language: Rust
Size: 2.88 MB
Dependencies parsed at: Pending
Created at: over 1 year ago
Updated at: 12 days ago
Pushed at: 9 months ago
Last synced at: 11 days ago
Topics: data-pipelines, docx, etl, etl-pipelines, extraction, llm, machine-learning, natural-language-processing, nlp, ocr, pdf, pdf-parser, rag, rust, tika, unstructured, unstructured-data