Topic: "docling"
shoryasethia/markdrop
A Python package for converting PDFs to markdown while extracting images and tables, generate descriptive text descriptions for extracted tables/images using several LLM clients. And many more functionalities. Markdrop is available on PyPI.
Language: Python - Size: 158 KB - Last synced at: 24 days ago - Pushed at: about 1 month ago - Stars: 84 - Forks: 3

genieincodebottle/parsemypdf
Collection of PDF parsing libraries like AI based docling, claude, openai, llama-vision, unstructured-io, and pdfminer, pymupdf, pdfplumber etc for efficient snapshot, text, table, and metadata extraction.
Language: Python - Size: 2.75 MB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 61 - Forks: 20

fahdmirza/doclingwithollama
Docling with Ollama - RAG on Local Files with Local Models
Language: Python - Size: 880 KB - Last synced at: 13 days ago - Pushed at: 4 months ago - Stars: 57 - Forks: 15

ghodsizadeh/pdf2csv
A python library and CLI tool to convert PDF files to CSV files.
Language: Python - Size: 475 KB - Last synced at: 14 days ago - Pushed at: 4 months ago - Stars: 26 - Forks: 1

versionHQ/multi-agent-system
Autonomous agent networks for task automation that requires multi-step reasoning
Language: Python - Size: 3.58 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 15 - Forks: 3

garyzava/chat-to-database-chatbot
Chat to your Database GenAI Chatbot
Language: Jupyter Notebook - Size: 24.5 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 8 - Forks: 2

felixdittrich92/docling-OCR-OnnxTR
OnnxTR OCR plugin for Docling
Language: Python - Size: 1.47 MB - Last synced at: 6 days ago - Pushed at: 18 days ago - Stars: 1 - Forks: 0

hemanthkt/impactoverse-AI-mentor
Developed an intelligent AI chatbot utilizing the DeepSeek LLM, designed for efficient interaction with large documents such as textbooks and study materials. Integrated Docling for parsing and processing large files, and implemented a Retrieval-Augmented Generation (RAG) pipeline using FAISS and Sentence Transformers to optimize context retrieval
Language: JavaScript - Size: 593 KB - Last synced at: 10 days ago - Pushed at: 19 days ago - Stars: 1 - Forks: 0

Jarus77/markdrop
A Python package for converting PDFs to markdown while extracting images and tables, generate descriptive text descriptions for extracted tables/images using several LLM clients. And many more functionalities. Markdrop is available on PyPI.
Language: Python - Size: 85.9 KB - Last synced at: 17 days ago - Pushed at: 22 days ago - Stars: 1 - Forks: 0

maciekmalachowski/CVWizard
🧙♂️AI-powered tool to optimize your CV with job-specific keywords and align it to your dream job.
Language: TypeScript - Size: 3.65 MB - Last synced at: 24 days ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

ParthaPRay/docling_RAG_langchain_colab
This repo contains codes for RAG using docling on colab notebook with langchain, milvus, huggingface embedding model and LLM
Language: Jupyter Notebook - Size: 20.5 KB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

katagaki/Lingus
PDF and Markdown conversion using Docling and LibreOffice
Language: Python - Size: 81.1 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

Rishang/deep-research
Python SDK for Deep-Research
Language: Python - Size: 196 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

ramona1999/Contract-Risk-Assessment
This project is an AI-powered Contract Risk Assessment and Legal Assistant designed to analyze legal documents, extract key clauses, assess risks, and provide actionable recommendations. Additionally, a fine-tuned conversational chatbot is integrated for interactive legal Q&A based on contract-specific knowledge.
Language: Jupyter Notebook - Size: 840 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0

thevladdo/rag-backend
Retrieval-Augmented Generation server with Pinecone and OpenAI
Language: HTML - Size: 46.9 KB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

docling-project/docling4j
Docling4j brings the functionalities of Docling in document understanding to Java® projects
Language: Java - Size: 16.6 KB - Last synced at: 30 days ago - Pushed at: 30 days ago - Stars: 0 - Forks: 0

jmxt3/pdf_to_txt_converter
A Python script that converts PDF files to text using the docling library. This tool is designed to batch process PDF files, making it easy to extract text content from multiple documents at once.
Language: Python - Size: 1.77 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

hyoaru/rag4jiya-process
Agentic RAG-based system with nursing handbooks and transes as knowledge base for my bebiloves
Language: Jupyter Notebook - Size: 13.8 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

kwame-mintah/python-langchain-chainlit-qdrant-ollama-stack-template
📄 A template for project for creating a chainlit application, using a locally run model via ollama and qdrant vector database for document retrieval.
Language: Python - Size: 46.9 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

patw/docinator
A small service to convert PDF files to Markdown using the Docling library
Language: Python - Size: 4.88 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

qlfv/Docling-Testing
Repository for testing and demonstrating the capabilities of Docling for document conversion.
Language: HTML - Size: 18.4 MB - Last synced at: 10 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 2

ParthaPRay/gradio_docling_rag_langchain
This repo provide RAG using Docling, langchain, milvus, sentence transformers, huggingface LLMs
Language: Jupyter Notebook - Size: 20.5 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

ParthaPRay/Docling_Colab
This repo contains google colab notebook for handing Docling for data extraction such as text, image, table etc.
Language: Jupyter Notebook - Size: 697 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

shrimantasatpati/Document_Parser_using_AI
Parse documents using AI - any document converted to markdown suitable for RAG applications
Language: Jupyter Notebook - Size: 12.2 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0
