An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: smoldocling

genieincodebottle/parsemypdf

Collection of PDF parsing libraries like AI based docling, claude, openai, gemini, meta's llama-vision, unstructured-io, and pdfminer, pymupdf, pdfplumber etc for efficient snapshot, text, table, and metadata extraction.

Language: Python - Size: 3.01 MB - Last synced at: 30 days ago - Pushed at: 30 days ago - Stars: 111 - Forks: 27

bujo-eayn/agenticAI_pipeline

A modular multi-agent AI system that performs deep scientific research using a supervisor-worker architecture. It combines foundational and specialized language models to reason, plan, and execute tasks for document and chart analysis in scientific domains.

Language: Python - Size: 7.87 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

PRITHIVSAKTHIUR/Multimodal-OCR2

A comprehensive multimodal OCR application that supports both image and video document processing using state-of-the-art vision-language models. This application provides an intuitive Gradio interface for extracting text, converting documents to markdown, and performing advanced document analysis.

Language: Python - Size: 3.03 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 4 - Forks: 0

SamuelGA0211/Multimodal-OCR2

Transform images and videos into text with Multimodal-OCR2. Enjoy advanced document analysis and conversion through a user-friendly interface. 🌟📂

Language: Python - Size: 2.78 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0