GitHub / Unstructured-IO 29 Repositories
Unstructured-IO/docs
Documentation for all Unstructured products and libraries
Language: MDX - Size: 25.4 MB - Last synced at: about 12 hours ago - Pushed at: about 13 hours ago - Stars: 6 - Forks: 22

Unstructured-IO/unstructured-js-client
A JavaScript/Typescript client for the Unstructured Platform API
Language: TypeScript - Size: 5.22 MB - Last synced at: 2 days ago - Pushed at: 17 days ago - Stars: 51 - Forks: 15

Unstructured-IO/unstructured-platform-plugins
Language: Python - Size: 184 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 4 - Forks: 1

Unstructured-IO/unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Language: HTML - Size: 192 MB - Last synced at: 6 days ago - Pushed at: 17 days ago - Stars: 10,915 - Forks: 907

Unstructured-IO/unstructured-ingest
Language: HTML - Size: 57.2 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 83 - Forks: 39

Unstructured-IO/unstructured-inference
Language: Python - Size: 31.8 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 178 - Forks: 59

Unstructured-IO/unstructured-api
Language: Python - Size: 38.8 MB - Last synced at: 11 days ago - Pushed at: 2 months ago - Stars: 708 - Forks: 159

Unstructured-IO/unstructured-python-client
A Python client for the Unstructured Platform API
Language: Python - Size: 7.03 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 99 - Forks: 16

Unstructured-IO/pipeline-paddleocr
Pipeline for converting PDFs to raw text with PaddleOCR
Language: Jupyter Notebook - Size: 6.55 MB - Last synced at: 18 days ago - Pushed at: over 1 year ago - Stars: 22 - Forks: 7

Unstructured-IO/UNS-MCP
Language: Python - Size: 266 KB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 12 - Forks: 1

Unstructured-IO/base-images
Store Dockerfiles and Packer configs for images to use as a base to build upon
Language: Shell - Size: 3.99 MB - Last synced at: 18 days ago - Pushed at: about 1 month ago - Stars: 4 - Forks: 2

Unstructured-IO/.github
Size: 123 KB - Last synced at: 14 days ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 2

Unstructured-IO/irs-manual-demo
Language: Python - Size: 63.8 MB - Last synced at: 18 days ago - Pushed at: almost 2 years ago - Stars: 15 - Forks: 7

Unstructured-IO/unstructured.PaddleOCR Fork of PaddlePaddle/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
Language: Python - Size: 357 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 33 - Forks: 4

Unstructured-IO/unstructured.pytesseract Fork of madmaze/pytesseract
A Python wrapper for Google Tesseract
Language: Python - Size: 1.46 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 4 - Forks: 1

Unstructured-IO/danswer Fork of onyx-dot-app/onyx
Gen-AI Chat for Teams - Think ChatGPT if it had access to your team's unique knowledge.
Language: Python - Size: 7.91 MB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 10 - Forks: 2

Unstructured-IO/pipeline-sec-filings 📦
Preprocessing pipeline notebooks and API supporting text extraction from SEC documents
Language: Jupyter Notebook - Size: 1.31 MB - Last synced at: 18 days ago - Pushed at: over 1 year ago - Stars: 143 - Forks: 31

Unstructured-IO/azure-ai-hub-gateway-solution-accelerator Fork of Azure-Samples/ai-hub-gateway-solution-accelerator
Reference architecture that provides a set of guidelines and best practices for implementing a central AI API gateway to empower various line-of-business units in an organization to leverage Azure AI services
Size: 5.78 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Unstructured-IO/model-cards
FedRAMP formatted model cards
Size: 37.1 KB - Last synced at: 14 days ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

Unstructured-IO/aws-blog-post-example
Script to accompany the AWS blog post on unstructured data ETL with Unstructured Ingest library
Language: Python - Size: 8.79 KB - Last synced at: 14 days ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Unstructured-IO/pipeline-receipts
Preprocessing pipeline notebooks and API supporting text extraction from receipts images
Language: Jupyter Notebook - Size: 1.39 MB - Last synced at: 14 days ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 2

Unstructured-IO/js-client-batch
JS Client Batch Processing
Language: JavaScript - Size: 98.4 MB - Last synced at: 14 days ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

Unstructured-IO/danswer-unstructured Fork of danswer-ai/danswer
Gen-AI Chat for Teams - Think ChatGPT if it had access to your team's unique knowledge.
Size: 7.21 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

Unstructured-IO/wolfi-dev-os Fork of wolfi-dev/os
Main package repository for production Wolfi images
Size: 64.9 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

Unstructured-IO/pipeline-oer
Pipeline for extraction information from Army OERs
Language: Jupyter Notebook - Size: 3.59 MB - Last synced at: 18 days ago - Pushed at: over 1 year ago - Stars: 8 - Forks: 5

Unstructured-IO/community 📦
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Size: 5.7 MB - Last synced at: 11 months ago - Pushed at: about 2 years ago - Stars: 19 - Forks: 6

Unstructured-IO/terraform-aws-ecs-web-app Fork of cloudposse/terraform-aws-ecs-web-app
Terraform module that implements a web app on ECS and supports autoscaling, CI/CD, monitoring, ALB integration, and much more.
Language: HCL - Size: 437 KB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

Unstructured-IO/unstructured.Paddle Fork of PaddlePaddle/Paddle
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
Size: 332 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 0

Unstructured-IO/langchain Fork of langchain-ai/langchain
⚡ Building applications with LLMs through composability ⚡
Language: Python - Size: 53.4 MB - Last synced at: about 1 year ago - Pushed at: over 1 year ago - Stars: 9 - Forks: 1

Unstructured-IO/unstructured-api-tools 📦
Language: Python - Size: 628 KB - Last synced at: 8 days ago - Pushed at: over 1 year ago - Stars: 28 - Forks: 11

Unstructured-IO/pipeline-template
Language: Python - Size: 149 KB - Last synced at: 18 days ago - Pushed at: over 1 year ago - Stars: 8 - Forks: 8

Unstructured-IO/super-gradients-fork Fork of Deci-AI/super-gradients
Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.
Language: Jupyter Notebook - Size: 251 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 1

Unstructured-IO/langchainjs Fork of langchain-ai/langchainjs
Language: TypeScript - Size: 7.45 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 1

Unstructured-IO/pipeline-document-layout
Pipeline for layout extraction
Language: Python - Size: 1.6 MB - Last synced at: 14 days ago - Pushed at: almost 2 years ago - Stars: 1 - Forks: 2

Unstructured-IO/chat-isw-reports Fork of hwchase17/chat-your-data
Language: Python - Size: 23.9 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 5 - Forks: 1

Unstructured-IO/pipeline-invoices
Language: Jupyter Notebook - Size: 4.67 MB - Last synced at: 18 days ago - Pushed at: almost 2 years ago - Stars: 4 - Forks: 3

Unstructured-IO/terraform-aws-ecs-alb-service-task Fork of cloudposse/terraform-aws-ecs-alb-service-task
Terraform module which implements an ECS service which exposes a web service via ALB.
Language: HCL - Size: 352 KB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

Unstructured-IO/prometheus-community-helm-charts Fork of prometheus-community/helm-charts
Prometheus community Helm charts
Language: Mustache - Size: 6.71 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 1
