GitHub topics: document-ai
athallahaiqal/document-ai
A simple FastAPI application that allows users to upload PDF or DOCX documents in a database, get a summary generated by a local LLM via Ollama, and ask natural language questions about their content.
Language: Python - Size: 63.5 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language: Python - Size: 73.6 MB - Last synced at: 5 days ago - Pushed at: 7 days ago - Stars: 21,320 - Forks: 2,626

smartloop-ai/smartloop
Smartloop is an open-source SLM platform to train and run models on an edge device
Language: Python - Size: 106 KB - Last synced at: 1 day ago - Pushed at: 5 days ago - Stars: 2 - Forks: 0

qyhou/curated-document-layout-analysis
A curated list of resources on Document Layout Analysis
Size: 10.7 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

akincenk/insightai
AI-powered semantic PDF intelligence engine. Upload PDFs, ask natural questions, get meaningful answers from inside documents using vector search and OpenAI.
Language: Python - Size: 14.6 KB - Last synced at: 5 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

deepdoctection/deepdoctection
A Repo For Document AI
Language: Python - Size: 27.9 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 2,836 - Forks: 159

nttmdlab-nlp/VDocRAG
[CVPR2025] VDocRAG: Retirval-Augmented Generation over Visually-Rich Documents
Language: Python - Size: 4.06 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 19 - Forks: 3

AntraTripathi74/Document-AI-Summarizer
Summarise your documents in any language using Document AI
Language: Jupyter Notebook - Size: 1.57 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

clovaai/donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
Language: Python - Size: 61.3 MB - Last synced at: 20 days ago - Pushed at: 11 months ago - Stars: 6,249 - Forks: 511

wintermi/ocr-runner
OCR Runner - Command Line Application for processing image files using Google Cloud Vision API and Google Cloud Document AI.
Language: Go - Size: 297 KB - Last synced at: 5 days ago - Pushed at: 21 days ago - Stars: 4 - Forks: 1

HimanshuMohanty-Git24/KhataGPT
Transform how you interact with documents! Simply upload receipts, invoices, or forms and instantly chat with them. Get answers, extract key information, and save hours of manual work. Your personal document assistant that understands what matters to you.
Language: JavaScript - Size: 3.68 MB - Last synced at: 23 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

zachurban/HousingMind
A curated training dataset for fine-tuning large language models on U.S. affordable housing policy, finance, public housing, LIHTC, regulations, and voucher program administration. Designed for compliance automation, technical assistance, and intelligent document generation in pursuit of affordable housing development and preservation.
Language: HTML - Size: 515 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 1 - Forks: 0

qyhou/curated-table-structure-recognition
A curated list of resources on Table Structure Recognition
Size: 88.9 KB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 7 - Forks: 1

tstanislawek/awesome-document-understanding
A curated list of resources for Document Understanding (DU) topic
Size: 5.56 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 1,405 - Forks: 160

googleapis/python-documentai-toolbox
Document AI Toolbox is an SDK for Python that provides utility functions for managing, manipulating, and extracting information from the document response. It creates a "wrapped" document object from JSON files in Cloud Storage, local JSON files, or output directly from the Document AI API.
Language: Python - Size: 17.6 MB - Last synced at: 10 days ago - Pushed at: 3 months ago - Stars: 43 - Forks: 18

ZeningLin/PEneo
[MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.
Language: Python - Size: 10.1 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 29 - Forks: 7

ozcanmiraay/opsbot
AI-powered PDF extraction suite for structured insights from contracts, forms, and documents. Built with Streamlit, LangChain, GPT-4o, and PDFPlumber.
Language: Python - Size: 9.61 MB - Last synced at: 6 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

nttmdlab-nlp/SlideVQA
SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)
Language: Python - Size: 14.7 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 87 - Forks: 8

doc-analysis/ReadingBank
ReadingBank: A Benchmark Dataset for Reading Order Detection
Size: 1.21 MB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 104 - Forks: 3

SCUT-DLVCLab/Document-AI-Recommendations
Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.
Size: 7.24 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 181 - Forks: 7

ZeningLin/ViBERTgrid-PyTorch
An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents. ICDAR, 2021"
Language: Python - Size: 388 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 54 - Forks: 5

coderhema/doci
Document Interpreter or DOCI for short is an ai document scanner and reader that uses ai to explain parts of a legal documentation
Language: Dart - Size: 338 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

sanikamal/gcp-ai-projects
Explore and implement powerful AI and Machine Learning solutions using Google Cloud Platform (GCP).
Language: Jupyter Notebook - Size: 7.19 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

SCUT-DLVCLab/RFUND
[MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction"
Size: 723 KB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 19 - Forks: 0

IonMich/batch-doc-vqa
Ask a question about a document collection and extract structured responses
Language: Python - Size: 3.42 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

whn09/table_structure_recognition
Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, and you can get the same (even better) result compared with Table Transformer (TATR) with smaller models.
Language: Jupyter Notebook - Size: 167 MB - Last synced at: 3 months ago - Pushed at: 11 months ago - Stars: 45 - Forks: 14

clovaai/webvicob
Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023
Language: Python - Size: 16.6 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 104 - Forks: 6

samprietoserrano/fraktur-ocr-transcription
Transcription project consisting of Python scripting and usage of ML text extraction models.
Language: Python - Size: 44.7 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

arakattack/ocr-transcript
This Flask application Google Cloud Document AI to extract name, IPK (GPA), university details, etc.
Language: Python - Size: 18.6 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

DunnBC22/Vision_Audio_and_Multimodal_Projects
This repository includes all computer vision, audio, document AI, and multimodal projects.
Language: Jupyter Notebook - Size: 108 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 28 - Forks: 5

Unstructured-IO/community 📦
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Size: 5.7 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 19 - Forks: 6

Purushothaman-natarajan/Custom-NER-Model-using-Spacy-Fine-Tuning
Spacy for Key:Value pairs
Language: Jupyter Notebook - Size: 15.9 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

conditionedstimulus/DocumentClassifier
FastAPI application for document classification using a multimodal LayoutLM model, designed to classify PDF documents into RVL-DCIP categories.
Language: Jupyter Notebook - Size: 1.6 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

OleksiiLatypov/Google_Cloud
AI & Data, Google Cloud Skills Boost
Language: Jupyter Notebook - Size: 3.67 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

bwnyasse/dart-documentai-samples
A hands-on CLI tool sample showcasing the integration of Dart with Google Cloud's DocumentAI.
Language: Dart - Size: 605 KB - Last synced at: 9 days ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

NirmalNagaraj/DocGPT
A Chatbot for the Document Analysis .
Language: Python - Size: 10.7 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

chenxn2020/GOSE
Code for the EMNLP2023 (Findings) paper "Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich Document""
Language: Python - Size: 11.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 7 - Forks: 0

ricardolsmendes/gcp-documentai-custom-extractors
Custom data extractors that use Google Cloud's Document AI
Size: 28.9 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

jpWang/LiLT
Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)
Language: Python - Size: 1.36 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 282 - Forks: 34

marcusmonteirodesouza/google-cloud-document-ai-rest-api-demo
Create an Identity Auto-Filler API with Google Cloud Document AI
Language: TypeScript - Size: 76.4 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

samkenxstream/SamKenX_documents-ai Fork of GoogleCloudPlatform/document-ai-samples
SamKenX applications and Document AI, the end-to-end document processing platform on Cloudstorage warehouse.
Language: Python - Size: 98.4 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 1

masoudshab/Doc2Edi
Extracting Data from Document PDF and Converting to EDI211 Files Using GCP and Google Document AI
Language: Python - Size: 71.3 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

dhorvay/document-understanding-ebook
(WIP) ✨ A comprehensive resource for understanding the world of software used in the Document Understanding field. 🧙✨
Language: Markdown - Size: 8.88 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

bhadreshpsavani/SmartOCR-with-LayoutLM
Exploring LayoutLM for Smart OCR Capabilities
Size: 1000 Bytes - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0
