An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: markitdown

genieincodebottle/parsemypdf

Collection of PDF parsing libraries like AI based docling, claude, openai, llama-vision, unstructured-io, and pdfminer, pymupdf, pdfplumber etc for efficient snapshot, text, table, and metadata extraction.

Language: Python - Size: 2.75 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 61 - Forks: 20

innobraingmbh/markitdown

Laravel bindings to microsoft/markitdown

Language: PHP - Size: 107 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 0

pig-mesh/office2md

[Required for large models] Office to Markdown service implementation, based on Microsoft Markitdown.

Language: Python - Size: 27.8 MB - Last synced at: 7 days ago - Pushed at: 11 days ago - Stars: 23 - Forks: 3

Climactic/markitdown-api

A REST API to Convert Files to Markdown with AI

Language: Python - Size: 24.4 KB - Last synced at: 9 days ago - Pushed at: 28 days ago - Stars: 4 - Forks: 0

Lynixtaxic/docsifer

Docsifer is a powerful tool for converting various data formats into Markdown for applications such as indexing, text analysis, and more. It supports PDF, PowerPoint, Word, Excel, Images, Audio, HTML, and other text-based formats, and leverages LLMs to enhance performance.

Size: 1.95 KB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 4 - Forks: 0

shoryasethia/markdrop

A Python package for converting PDFs to markdown while extracting images and tables, generate descriptive text descriptions for extracted tables/images using several LLM clients. And many more functionalities. Markdrop is available on PyPI.

Language: Python - Size: 158 KB - Last synced at: 18 days ago - Pushed at: 25 days ago - Stars: 84 - Forks: 3

prakash-aryan/grocery_price_assistant

An intelligent grocery shopping assistant that calculates prices, generates beautiful receipts and answers questions about your grocery database using LLM and Langchain.

Language: Python - Size: 405 KB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 0 - Forks: 0

tolios/XPL

A simple cli tool for RAG on documents

Language: Python - Size: 97.7 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

lh0x00/docsifer

Docsifer is a powerful tool for converting various data formats into Markdown for applications such as indexing, text analysis, and more. It supports PDF, PowerPoint, Word, Excel, Images, Audio, HTML, and other text-based formats, and leverages LLMs to enhance performance.

Language: Python - Size: 150 KB - Last synced at: 23 days ago - Pushed at: about 2 months ago - Stars: 5 - Forks: 0

aromalanil/markItDown

📱 A React app to preview and edit Markdown✍. You can also export it as HTML.

Language: JavaScript - Size: 612 KB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 42 - Forks: 10

lh0x00/embs

embs is a Python toolkit for retrieving documents (via Docsifer), generating embeddings (via Lightweight Embeddings API), and ranking texts with an optional caching system.

Language: Python - Size: 112 KB - Last synced at: 23 days ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

bigbag/cv_matcher

CV Matcher is a Python-based application that helps analyze resumes and match them against job descriptions. It provides both CLI and server-based interfaces for resume analysis.

Language: Python - Size: 264 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

9bow/markitdown-api-fly-io

Simple FastAPI wrapper for Document-to-Markdown conversion using Microsoft's MarkItDown library.

Language: Python - Size: 16.6 KB - Last synced at: about 18 hours ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

LF3551/AutoDocMark

AutoDocMark: Streamline Document-to-Markdown Workflows

Language: Python - Size: 112 KB - Last synced at: 2 days ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

meenbeese/md-convert

Basic wrapper around the MarkItDown library from Microsoft.

Language: Python - Size: 8.79 KB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0