An open API service providing repository metadata for many open source software ecosystems.

Package Usage: pypi: unstructured-api-tools

A library that prepares raw documents for downstream ML tasks.
32 versions
Latest release: almost 2 years ago
731 downloads last month

View more package details: https://packages.ecosyste.ms/registries/pypi.org/packages/unstructured-api-tools

View more repository details: http://repos.ecosyste.ms/hosts/GitHub/repositories/Unstructured-IO%2Funstructured-api-tools

Dependent Repos 2

Unstructured-IO/pipeline-sec-filings 📦
Preprocessing pipeline notebooks and API supporting text extraction from SEC documents

Size: 1.31 MB - Last synced: 14 days ago - Pushed: over 1 year ago

Unstructured-IO/pipeline-oer
Pipeline for extraction information from Army OERs

Size: 3.59 MB - Last synced: 14 days ago - Pushed: over 1 year ago

Unstructured-IO/pipeline-paddleocr
Pipeline for converting PDFs to raw text with PaddleOCR

Size: 6.55 MB - Last synced: 14 days ago - Pushed: over 1 year ago

Pablongo24/langchain-test

Size: 492 KB - Last synced: 1 day ago - Pushed: over 2 years ago

theZaX/unstructured-api Fork of Unstructured-IO/unstructured-api

Size: 5.43 MB - Last synced: over 1 year ago - Pushed: over 1 year ago

j-chacko/Unstructured
Created a Python script that uses Unstructured.io's Open Source Library to process files.

Size: 15.6 KB - Last synced: 4 months ago - Pushed: 4 months ago