An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: document-ai

athallahaiqal/document-ai

A simple FastAPI application that allows users to upload PDF or DOCX documents in a database, get a summary generated by a local LLM via Ollama, and ask natural language questions about their content.

Language: Python - Size: 63.5 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

microsoft/unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Language: Python - Size: 73.6 MB - Last synced at: 5 days ago - Pushed at: 7 days ago - Stars: 21,320 - Forks: 2,626

smartloop-ai/smartloop

Smartloop is an open-source SLM platform to train and run models on an edge device

Language: Python - Size: 106 KB - Last synced at: 1 day ago - Pushed at: 5 days ago - Stars: 2 - Forks: 0

qyhou/curated-document-layout-analysis

A curated list of resources on Document Layout Analysis

Size: 10.7 KB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

akincenk/insightai

AI-powered semantic PDF intelligence engine. Upload PDFs, ask natural questions, get meaningful answers from inside documents using vector search and OpenAI.

Language: Python - Size: 14.6 KB - Last synced at: 5 days ago - Pushed at: 13 days ago - Stars: 0 - Forks: 0

deepdoctection/deepdoctection

A Repo For Document AI

Language: Python - Size: 27.9 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 2,836 - Forks: 159

nttmdlab-nlp/VDocRAG

[CVPR2025] VDocRAG: Retirval-Augmented Generation over Visually-Rich Documents

Language: Python - Size: 4.06 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 19 - Forks: 3

AntraTripathi74/Document-AI-Summarizer

Summarise your documents in any language using Document AI

Language: Jupyter Notebook - Size: 1.57 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 0 - Forks: 0

clovaai/donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Language: Python - Size: 61.3 MB - Last synced at: 20 days ago - Pushed at: 11 months ago - Stars: 6,249 - Forks: 511

wintermi/ocr-runner

OCR Runner - Command Line Application for processing image files using Google Cloud Vision API and Google Cloud Document AI.

Language: Go - Size: 297 KB - Last synced at: 5 days ago - Pushed at: 21 days ago - Stars: 4 - Forks: 1

HimanshuMohanty-Git24/KhataGPT

Transform how you interact with documents! Simply upload receipts, invoices, or forms and instantly chat with them. Get answers, extract key information, and save hours of manual work. Your personal document assistant that understands what matters to you.

Language: JavaScript - Size: 3.68 MB - Last synced at: 23 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

zachurban/HousingMind

A curated training dataset for fine-tuning large language models on U.S. affordable housing policy, finance, public housing, LIHTC, regulations, and voucher program administration. Designed for compliance automation, technical assistance, and intelligent document generation in pursuit of affordable housing development and preservation.

Language: HTML - Size: 515 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 1 - Forks: 0

qyhou/curated-table-structure-recognition

A curated list of resources on Table Structure Recognition

Size: 88.9 KB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 7 - Forks: 1

tstanislawek/awesome-document-understanding

A curated list of resources for Document Understanding (DU) topic

Size: 5.56 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 1,405 - Forks: 160

googleapis/python-documentai-toolbox

Document AI Toolbox is an SDK for Python that provides utility functions for managing, manipulating, and extracting information from the document response. It creates a "wrapped" document object from JSON files in Cloud Storage, local JSON files, or output directly from the Document AI API.

Language: Python - Size: 17.6 MB - Last synced at: 10 days ago - Pushed at: 3 months ago - Stars: 43 - Forks: 18

ZeningLin/PEneo

[MM'2024] PEneo, an effective algorithm for key-value pair extraction from form-like documents, designed for real-world applications.

Language: Python - Size: 10.1 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 29 - Forks: 7

ozcanmiraay/opsbot

AI-powered PDF extraction suite for structured insights from contracts, forms, and documents. Built with Streamlit, LangChain, GPT-4o, and PDFPlumber.

Language: Python - Size: 9.61 MB - Last synced at: 6 days ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

nttmdlab-nlp/SlideVQA

SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)

Language: Python - Size: 14.7 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 87 - Forks: 8

doc-analysis/ReadingBank

ReadingBank: A Benchmark Dataset for Reading Order Detection

Size: 1.21 MB - Last synced at: 3 months ago - Pushed at: 10 months ago - Stars: 104 - Forks: 3

SCUT-DLVCLab/Document-AI-Recommendations

Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.

Size: 7.24 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 181 - Forks: 7

ZeningLin/ViBERTgrid-PyTorch

An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents. ICDAR, 2021"

Language: Python - Size: 388 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 54 - Forks: 5

coderhema/doci

Document Interpreter or DOCI for short is an ai document scanner and reader that uses ai to explain parts of a legal documentation

Language: Dart - Size: 338 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

sanikamal/gcp-ai-projects

Explore and implement powerful AI and Machine Learning solutions using Google Cloud Platform (GCP).

Language: Jupyter Notebook - Size: 7.19 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

SCUT-DLVCLab/RFUND

[MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction"

Size: 723 KB - Last synced at: 3 months ago - Pushed at: 6 months ago - Stars: 19 - Forks: 0

IonMich/batch-doc-vqa

Ask a question about a document collection and extract structured responses

Language: Python - Size: 3.42 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

whn09/table_structure_recognition

Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, and you can get the same (even better) result compared with Table Transformer (TATR) with smaller models.

Language: Jupyter Notebook - Size: 167 MB - Last synced at: 3 months ago - Pushed at: 11 months ago - Stars: 45 - Forks: 14

clovaai/webvicob

Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023

Language: Python - Size: 16.6 MB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 104 - Forks: 6

samprietoserrano/fraktur-ocr-transcription

Transcription project consisting of Python scripting and usage of ML text extraction models.

Language: Python - Size: 44.7 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

arakattack/ocr-transcript

This Flask application Google Cloud Document AI to extract name, IPK (GPA), university details, etc.

Language: Python - Size: 18.6 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

DunnBC22/Vision_Audio_and_Multimodal_Projects

This repository includes all computer vision, audio, document AI, and multimodal projects.

Language: Jupyter Notebook - Size: 108 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 28 - Forks: 5

Unstructured-IO/community 📦

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

Size: 5.7 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 19 - Forks: 6

Purushothaman-natarajan/Custom-NER-Model-using-Spacy-Fine-Tuning

Spacy for Key:Value pairs

Language: Jupyter Notebook - Size: 15.9 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

conditionedstimulus/DocumentClassifier

FastAPI application for document classification using a multimodal LayoutLM model, designed to classify PDF documents into RVL-DCIP categories.

Language: Jupyter Notebook - Size: 1.6 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

OleksiiLatypov/Google_Cloud

AI & Data, Google Cloud Skills Boost

Language: Jupyter Notebook - Size: 3.67 MB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

bwnyasse/dart-documentai-samples

A hands-on CLI tool sample showcasing the integration of Dart with Google Cloud's DocumentAI.

Language: Dart - Size: 605 KB - Last synced at: 9 days ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

NirmalNagaraj/DocGPT

A Chatbot for the Document Analysis .

Language: Python - Size: 10.7 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

chenxn2020/GOSE

Code for the EMNLP2023 (Findings) paper "Global Structure Knowledge-Guided Relation Extraction Method for Visually-Rich Document""

Language: Python - Size: 11.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 7 - Forks: 0

ricardolsmendes/gcp-documentai-custom-extractors

Custom data extractors that use Google Cloud's Document AI

Size: 28.9 MB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

jpWang/LiLT

Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)

Language: Python - Size: 1.36 MB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 282 - Forks: 34

marcusmonteirodesouza/google-cloud-document-ai-rest-api-demo

Create an Identity Auto-Filler API with Google Cloud Document AI

Language: TypeScript - Size: 76.4 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

samkenxstream/SamKenX_documents-ai Fork of GoogleCloudPlatform/document-ai-samples

SamKenX applications and Document AI, the end-to-end document processing platform on Cloudstorage warehouse.

Language: Python - Size: 98.4 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 1

masoudshab/Doc2Edi

Extracting Data from Document PDF and Converting to EDI211 Files Using GCP and Google Document AI

Language: Python - Size: 71.3 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

dhorvay/document-understanding-ebook

(WIP) ✨ A comprehensive resource for understanding the world of software used in the Document Understanding field. 🧙✨

Language: Markdown - Size: 8.88 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 2 - Forks: 0

bhadreshpsavani/SmartOCR-with-LayoutLM

Exploring LayoutLM for Smart OCR Capabilities

Size: 1000 Bytes - Last synced at: over 2 years ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

Related Keywords
document-ai 44 ocr 14 nlp 13 document-understanding 9 machine-learning 6 document-intelligence 5 computer-vision 5 key-information-extraction 5 google-cloud 5 python 5 ai 4 google-cloud-platform 4 visual-information-extraction 4 table-structure-recognition 3 gcp 3 document-analysis 3 information-extraction 3 document-layout-analysis 3 natural-language-processing 3 llm 3 layoutlm 3 chatbot 3 multimodal-pre-trained-model 2 gemini 2 pytorch 2 table-recognition 2 table-detection 2 ocr-python 2 semantic-search 2 pdf 2 openai 2 fastapi 2 deep-learning 2 vertex-ai 2 streamlit 2 document-image-analysis 2 function-calling 1 object-detection 1 automation 1 multimodal-deep-learning 1 audio-classification 1 contracts 1 gpt-4o 1 langchain 1 pdf-extraction 1 transcription 1 text-processing 1 spellchecker 1 research-project 1 python-script 1 german 1 scanner 1 imagens 1 structured-data 1 aaai2023 1 fraktur 1 digital-humanities 1 icdar2023 1 yolov8 1 yolov5 1 table 1 ollama 1 local-llm 1 llama 1 docvqa 1 recommendation-system 1 rag 1 dartlang 1 samples 1 relation-extraction 1 data-extraction 1 multilingual-models 1 express 1 nextjs 1 nodejs 1 terraform 1 api 1 attributor 1 iacknowledgements 1 ip 1 warehouse-management-system 1 google 1 google-document-ai 1 awesome-document-understanding 1 ebook 1 document-inteligence 1 optical-character-recognition 1 transfer-learning 1 transformers 1 community 1 data-pipeline 1 document-parsing 1 nlp-parsing 1 open-source 1 preprocessing-data 1 code 1 ner 1 neural-network 1 nlp-keywords-extraction 1 spacy 1