An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: mistral-ocr

Datalore-ai/datalore-localgen-cli

synthetic dataset generation workflow using local file resources for finetuning llms.

Language: Python - Size: 2.77 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 73 - Forks: 8

renswickd/document-parser-collection

This is a collection of various document parsers and hands-on to construct structured data for your RAG applications.

Language: Python - Size: 97.7 KB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 1 - Forks: 0

genieincodebottle/parsemypdf

Collection of PDF parsing libraries like AI based docling, claude, openai, gemini, meta's llama-vision, unstructured-io, and pdfminer, pymupdf, pdfplumber etc for efficient snapshot, text, table, and metadata extraction.

Language: Python - Size: 3.01 MB - Last synced at: 30 days ago - Pushed at: 30 days ago - Stars: 111 - Forks: 27

mittapallynitin/PodcastAI

Podcast AI backend built with FastAPI, powered by Mistral for LLM summarization, and MCP.

Language: Python - Size: 10.7 KB - Last synced at: about 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

lavvsharma/py_mistral_helper

A Python helper for extracting text from PDFs and images using Mistral OCR

Language: Python - Size: 0 Bytes - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0