An open API service providing repository metadata for many open source software ecosystems.

Topic: "docling"

shoryasethia/markdrop

A Python package for converting PDFs to markdown while extracting images and tables, generate descriptive text descriptions for extracted tables/images using several LLM clients. And many more functionalities. Markdrop is available on PyPI.

Language: Python - Size: 158 KB - Last synced at: 24 days ago - Pushed at: about 1 month ago - Stars: 84 - Forks: 3

genieincodebottle/parsemypdf

Collection of PDF parsing libraries like AI based docling, claude, openai, llama-vision, unstructured-io, and pdfminer, pymupdf, pdfplumber etc for efficient snapshot, text, table, and metadata extraction.

Language: Python - Size: 2.75 MB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 61 - Forks: 20

fahdmirza/doclingwithollama

Docling with Ollama - RAG on Local Files with Local Models

Language: Python - Size: 880 KB - Last synced at: 13 days ago - Pushed at: 4 months ago - Stars: 57 - Forks: 15

ghodsizadeh/pdf2csv

A python library and CLI tool to convert PDF files to CSV files.

Language: Python - Size: 475 KB - Last synced at: 14 days ago - Pushed at: 4 months ago - Stars: 26 - Forks: 1

versionHQ/multi-agent-system

Autonomous agent networks for task automation that requires multi-step reasoning

Language: Python - Size: 3.58 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 15 - Forks: 3

garyzava/chat-to-database-chatbot

Chat to your Database GenAI Chatbot

Language: Jupyter Notebook - Size: 24.5 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 8 - Forks: 2

felixdittrich92/docling-OCR-OnnxTR

OnnxTR OCR plugin for Docling

Language: Python - Size: 1.47 MB - Last synced at: 6 days ago - Pushed at: 18 days ago - Stars: 1 - Forks: 0

hemanthkt/impactoverse-AI-mentor

Developed an intelligent AI chatbot utilizing the DeepSeek LLM, designed for efficient interaction with large documents such as textbooks and study materials. Integrated Docling for parsing and processing large files, and implemented a Retrieval-Augmented Generation (RAG) pipeline using FAISS and Sentence Transformers to optimize context retrieval

Language: JavaScript - Size: 593 KB - Last synced at: 10 days ago - Pushed at: 19 days ago - Stars: 1 - Forks: 0

Jarus77/markdrop

A Python package for converting PDFs to markdown while extracting images and tables, generate descriptive text descriptions for extracted tables/images using several LLM clients. And many more functionalities. Markdrop is available on PyPI.

Language: Python - Size: 85.9 KB - Last synced at: 17 days ago - Pushed at: 22 days ago - Stars: 1 - Forks: 0

maciekmalachowski/CVWizard

🧙‍♂️AI-powered tool to optimize your CV with job-specific keywords and align it to your dream job.

Language: TypeScript - Size: 3.65 MB - Last synced at: 24 days ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

ParthaPRay/docling_RAG_langchain_colab

This repo contains codes for RAG using docling on colab notebook with langchain, milvus, huggingface embedding model and LLM

Language: Jupyter Notebook - Size: 20.5 KB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

katagaki/Lingus

PDF and Markdown conversion using Docling and LibreOffice

Language: Python - Size: 81.1 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

Rishang/deep-research

Python SDK for Deep-Research

Language: Python - Size: 196 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0

ramona1999/Contract-Risk-Assessment

This project is an AI-powered Contract Risk Assessment and Legal Assistant designed to analyze legal documents, extract key clauses, assess risks, and provide actionable recommendations. Additionally, a fine-tuned conversational chatbot is integrated for interactive legal Q&A based on contract-specific knowledge.

Language: Jupyter Notebook - Size: 840 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 0 - Forks: 0

thevladdo/rag-backend

Retrieval-Augmented Generation server with Pinecone and OpenAI

Language: HTML - Size: 46.9 KB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

docling-project/docling4j

Docling4j brings the functionalities of Docling in document understanding to Java® projects

Language: Java - Size: 16.6 KB - Last synced at: 30 days ago - Pushed at: 30 days ago - Stars: 0 - Forks: 0

jmxt3/pdf_to_txt_converter

A Python script that converts PDF files to text using the docling library. This tool is designed to batch process PDF files, making it easy to extract text content from multiple documents at once.

Language: Python - Size: 1.77 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

hyoaru/rag4jiya-process

Agentic RAG-based system with nursing handbooks and transes as knowledge base for my bebiloves

Language: Jupyter Notebook - Size: 13.8 MB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

kwame-mintah/python-langchain-chainlit-qdrant-ollama-stack-template

📄 A template for project for creating a chainlit application, using a locally run model via ollama and qdrant vector database for document retrieval.

Language: Python - Size: 46.9 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

patw/docinator

A small service to convert PDF files to Markdown using the Docling library

Language: Python - Size: 4.88 KB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

qlfv/Docling-Testing

Repository for testing and demonstrating the capabilities of Docling for document conversion.

Language: HTML - Size: 18.4 MB - Last synced at: 10 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 2

ParthaPRay/gradio_docling_rag_langchain

This repo provide RAG using Docling, langchain, milvus, sentence transformers, huggingface LLMs

Language: Jupyter Notebook - Size: 20.5 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

ParthaPRay/Docling_Colab

This repo contains google colab notebook for handing Docling for data extraction such as text, image, table etc.

Language: Jupyter Notebook - Size: 697 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

shrimantasatpati/Document_Parser_using_AI

Parse documents using AI - any document converted to markdown suitable for RAG applications

Language: Jupyter Notebook - Size: 12.2 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0