An open API service providing repository metadata for many open source software ecosystems.

Topic: "document-image-analysis"

Unstructured-IO/unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

Language: HTML - Size: 192 MB - Last synced at: 5 days ago - Pushed at: 7 days ago - Stars: 11,009 - Forks: 917

deepdoctection/deepdoctection

A Repo For Document AI

Language: Python - Size: 21.8 MB - Last synced at: 10 days ago - Pushed at: 23 days ago - Stars: 2,796 - Forks: 154

enoch3712/ExtractThinker

ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.

Language: Python - Size: 20.3 MB - Last synced at: 9 days ago - Pushed at: 9 days ago - Stars: 1,205 - Forks: 118

hpanwar08/detectron2 Fork of facebookresearch/detectron2

Detectron2 for Document Layout Analysis

Language: Python - Size: 4.53 MB - Last synced at: about 1 year ago - Pushed at: over 4 years ago - Stars: 178 - Forks: 62

chulwoopack/docstrum

Language: Jupyter Notebook - Size: 97.1 MB - Last synced at: about 1 month ago - Pushed at: about 7 years ago - Stars: 69 - Forks: 21

huyhoang17/kuzushiji_recognition

[Late Submission] Solution for Kuzushiji recognition (Kaggle competition)

Language: Python - Size: 90 MB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 17 - Forks: 2

chulwoopack/gravity-map

Visual Domain Knowledge-based Multimodal Zoning Textual Region Localization in Noisy Historical Document Images

Language: C++ - Size: 1.2 GB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 0

chulwoopack/document_complexity

Analyze document image complexity based on segmentation results

Language: Python - Size: 2.93 KB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 1 - Forks: 0

athallahaiqal/document-ai

A simple FastAPI application that allows users to upload PDF or DOCX documents in a database, get a summary generated by a local LLM via Ollama, and ask natural language questions about their content.

Language: Python - Size: 64.5 KB - Last synced at: about 4 hours ago - Pushed at: about 5 hours ago - Stars: 0 - Forks: 0

chulwoopack/voronoi_based_docu_complexity_analysis

Language: Jupyter Notebook - Size: 267 KB - Last synced at: almost 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0

chulwoopack/Mask_RCNN_SegDog

Language: Jupyter Notebook - Size: 609 MB - Last synced at: almost 2 years ago - Pushed at: about 6 years ago - Stars: 0 - Forks: 0