GitHub topics: ocr-python

Repositories

rampage445/OCR-Bangla

A Python script that uses Gemini OCR to extract text from newspaper screenshots and generates concise summaries of the content and syncs with google-drive

Language: Python - Size: 9.65 MB - Last synced at: about 23 hours ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

tarrantwrong366/OCR-Document-parser

📝 Streamline document analysis by extracting key fields from PAN cards, resumes, and handwritten notes using Tesseract OCR and a simple Streamlit interface.

Language: Python - Size: 21.5 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

TRZMIELLL/TourismusGardeshgari-Card-Scanner

A powerful OCR tool for extracting information from Tourismus Gardeshgari bank cards. Extracts card numbers, expiry dates, CVV codes, and Sheba numbers with high accuracy using advanced image processing techniques.

Language: Python - Size: 293 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

hiroi-sora/Umi-OCR

OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片，PDF文档识别，排除水印/页眉页脚，扫描/生成二维码。内置多国语言库。

Language: Python - Size: 268 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 39,748 - Forks: 3,936

Hkwln/nnupytorch

a bunch of nn/ml model, for getting to know pytorch

Language: Jupyter Notebook - Size: 57.8 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 1 - Forks: 0

Sankeerth4026/toxicSafeText

Prototype app that detects toxic text on screen using OCR and blurs it in real time. Experimental and not fully working — improvements welcome

Language: Python - Size: 1.79 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

duonghieu7104/TikTok-Video-Scan

🤖 TikTok video analyzer using AI: Speech transcription, OCR, object detection, and AI summaries. Local demos + Docker data lake house.

Language: Python - Size: 17.6 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

herbicider/HayatPrecheck

A Screen OCR for pharmacy software data entry verification purpose

Language: Python - Size: 117 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

Sabastincruzz/Tools_DeepSeekOCR

🖥️ Deploy DeepSeek-OCR for Optical Character Recognition directly from screen captures on Windows, enabling efficient text extraction from images.

Language: Python - Size: 1.31 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

chaffybird56/morpho-plate

lightweight ALPR pipeline; designed for real‑time use on roadway footage

Language: Python - Size: 67.4 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

Y2marcos/DeepSeek-OCR-Studio

This tool Help you Convert documents to markdown, extract raw text, and locate specific content with bounding boxes. It takes 20~ sec for markdown and 3~ sec for locate task. Check the info at the bottom of the page for more information.

Language: Python - Size: 107 KB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

StabRise/ScaleDP-Tutorials

Tutorials for ScaleDP library. ScaleDP is an Open-Source Library for Processing Documents in Apache Spark.

Language: Jupyter Notebook - Size: 19 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 5 - Forks: 0

CatchTheTornado/text-extract-api

Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown

Language: Python - Size: 5.06 MB - Last synced at: 7 days ago - Pushed at: about 2 months ago - Stars: 2,916 - Forks: 247

StabRise/ScaleDP

ScaleDP is an Open-Source extension of Apache Spark for Document Processing

Language: Python - Size: 16 MB - Last synced at: 8 days ago - Pushed at: 10 days ago - Stars: 16 - Forks: 2

mwasifanwar/DocuMind-AI

Advanced OCR and document understanding system that extracts, classifies, and analyzes complex documents. Handles tables, forms, invoices, and contracts using transformer-based models and layout understanding.

Language: Python - Size: 29.3 KB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 0 - Forks: 0

voun7/VidSubX

A program for extracting hard coded (burned in) subtitle from a video and generating an external subtitle.

Language: Python - Size: 309 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 45 - Forks: 9

open-source-modelling/SFCR_using_Mistral

Transform pdfs into DataFrames using Mistral OCR and Python.

Language: Jupyter Notebook - Size: 97.6 MB - Last synced at: 11 days ago - Pushed at: 12 days ago - Stars: 3 - Forks: 0

oidlabs-com/Lexoid

Multimodal document parser for high quality data understanding and extraction

Language: Python - Size: 48.2 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 84 - Forks: 10

RepoRookies/Rookie-OCR

Computer Vision course project which aims to build an OCR pipeline from the ground up, focusing on understanding the image preprocessing, text segmentation, and deep learning–based recognition processes.

Language: Jupyter Notebook - Size: 23.4 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 1 - Forks: 0

zhiweiiii/fapiao-ocr-excel

基于OCR技术的自动识别发票内容，导出到Excel。（自动识别图片、pdf文件）

Language: Python - Size: 164 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 24 - Forks: 0

sethupavan12/Markdownify

Convert documents, images to high-quality Markdown using Vision LLMs. Built for RAG ingestion pipelines.

Language: Python - Size: 12.8 MB - Last synced at: 9 days ago - Pushed at: 3 months ago - Stars: 16 - Forks: 1

bentoml/BentoOCR

Turn any OCR models into online inference API endpoint 🚀 🌖

Language: Python - Size: 2.83 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 56 - Forks: 4

shibing624/imgocr

Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB), with ultra-fast inference speed. 基于ppocr-v5-onnx模型推理，中英文OCR开源SOTA，推理速度超快。

Language: Python - Size: 49.2 MB - Last synced at: 20 days ago - Pushed at: 4 months ago - Stars: 107 - Forks: 15

fapulito/vercel_textract

Deploy to Vercel - Python Client for AWS Textract | OCR SaaS with Development Roadmap

Language: HTML - Size: 117 KB - Last synced at: 24 days ago - Pushed at: 24 days ago - Stars: 0 - Forks: 0

dwsilvar/pehape

Language: Python - Size: 186 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

reuAC/Tools_DeepSeekOCR

A Windows-based screenshot OCR utility powered by DeepSeek-OCR. This tool allows users to quickly capture screen regions and perform high-accuracy Optical Character Recognition (OCR) directly on the captured image, leveraging the powerful DeepSeek-OCR model. It supports local model deployment and features real-time model output streaming.

Language: Python - Size: 12.7 KB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 0 - Forks: 0

marviniciuz/flamel

Language: Python - Size: 23.4 KB - Last synced at: 27 days ago - Pushed at: 28 days ago - Stars: 0 - Forks: 0

knochenhans/OCRReader2

OCR GUI based on Python and Qt6 designed to prepare and OCR images with complex layouts, mainly for Tesseract OCR.

Language: Python - Size: 65.8 MB - Last synced at: 27 days ago - Pushed at: 28 days ago - Stars: 0 - Forks: 0

udit-asopa/vision-text-extractor Fork of t-redactyl/ocr-llm-agent

Extract text from images using multiple AI providers - local SmolVLM, Ollama LLaVA, or OpenAI GPT-4o

Language: Python - Size: 4.75 MB - Last synced at: 29 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

Kuju29/TextPhantomOCR_Overlay

🧠 A Chrome extension that instantly translate text from images right inside your browser! Perfect for manga, webtoons, and any image-based content.

Language: JavaScript - Size: 232 KB - Last synced at: 30 days ago - Pushed at: about 1 month ago - Stars: 4 - Forks: 0

marviniciuz/homero_ocr

Language: Python - Size: 30.3 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

pythonicshariful/phone-number-extractor

A Python script that extracts phone numbers from images using Tesseract OCR and Regex. Automatically organizes processed images into success and failed folders, and saves results to a CSV file.

Language: Python - Size: 84 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

ankandrew/fast-plate-ocr

Lightweight & fast OCR models for license plate text recognition.

Language: Python - Size: 267 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 289 - Forks: 43

breezedeus/CnOCR Fork of diaomin/crnn-mxnet-chinese-text-recognition

CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scenarios and can be used directly after installation. 【基于 PyTorch/MXNet 的中文/英文 OCR Python 包。】

Language: Python - Size: 17.5 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 3,653 - Forks: 528

xtekky/zefoy-captcha-solver

Zefoy OCR captcha solver | 99% accurate

Language: Python - Size: 27.3 KB - Last synced at: about 1 month ago - Pushed at: over 3 years ago - Stars: 35 - Forks: 9

k8731/Rise-Of-Kingdoms-Alliance-Contribution-Tracker

Python tool to extract and organize Rise of Kingdoms player stats from screenshots using OCR.

Language: Python - Size: 141 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

sarahlorenzen/Outamation-Extern

RAG & OCR Document Parser AI / Automation Externship

Language: Jupyter Notebook - Size: 936 KB - Last synced at: 22 days ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

Leoneix/ocrProj

A project to make machine readable pdfs along with other upcoming features.

Language: Python - Size: 15.6 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

shreyaj661/shreyaj661.github.io

Academic portfolio and CV hosted on GitHub Pages. Includes education, research experience, publications, and projects.

Language: HTML - Size: 11.7 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Daniel-xue/paddle-LPR

implementation of license plate recognition with PaddleOCR.

Language: Python - Size: 5.66 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

fabriziosalmi/pdf-ocr

Converts scanned PDF documents to multiple formats using Optical Character Recognition

Language: HTML - Size: 28.8 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 6 - Forks: 0

hanifabd/pisahkan-ktp

Python Package for Information Extraction and Segmentation - Segmentasi KTP Indonesia - Indonesian ID Card - Information Segmentation

Language: Python - Size: 534 KB - Last synced at: about 2 months ago - Pushed at: 10 months ago - Stars: 8 - Forks: 1

spartan3661/Gamegrab

OCR app for video games

Language: Python - Size: 224 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

BatuhanYilmaz26/RAG-With-Routing

Streamlit app for retrieval-augmented generation (RAG) with multi-database routing, OCR ingestion, and a resilient web fallback.

Language: Python - Size: 3.65 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

moheladwy/OCR4Linux

OCR Script Tool for Extracting Text from Screenshots (images) using bash, and python scripts only

Language: Shell - Size: 43.9 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 26 - Forks: 4

gnana70/tamil_ocr

OCR Tamil is a powerful tool that can detect and recognize text in Tamil images with high accuracy on Natural Scenes

Language: Python - Size: 820 MB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 71 - Forks: 13

AltayYuzeir/PaddleOCR-UI

📚 A high-quality tool to convert PDF to Docx with PaddleX & PaddleOCR with UI

Language: Python - Size: 590 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 4 - Forks: 0

DynamiteDan/wordbomb-cheat

WordBomb cheat that is written entirely in Python using Tesseract OCR.

Language: Python - Size: 105 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

farhanfahim00/library

A Python desktop app for organizing personal book libraries with OCR support, JSON-based storage, and a full GUI. Built with Tkinter, PyOCR, and Unittest.

Language: Python - Size: 2.78 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Takk8IS/TheArchivistLens

This system implements the complete Reinert Method of Descending Hierarchical Classification (DHC), offering all IRaMuTeQ functionalities

Language: Python - Size: 117 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

PSPDFKit/nutrient-dws-client-python

Official Python client library for Nutrient Document Web Services API - PDF processing, OCR, watermarking, and document manipulation with automatic Office format conversion

Language: Python - Size: 3.02 MB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 54 - Forks: 0

carlosacchi/captiocr

CaptiOCR - A real-time screen text extraction tool using Tesseract OCR. Capture, recognize, and log on-screen text dynamically. Future updates will include on-demand language installation, resizable selection areas, and live text overlays.

Language: Python - Size: 734 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 1

LATIS-DocumentAI-Team/DocumentAI-std

DocumentAI-std is a Python library designed to facilitate and standardize document analysis and processing tasks. It offers functionality for handling document elements, performing optical character recognition (OCR), and managing document datasets.

Language: Python - Size: 359 KB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

sergiocorreia/quipucamayoc

dev repo for article

Language: Python - Size: 30.3 MB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 30 - Forks: 5

genieincodebottle/parsemypdf

Collection of PDF parsing libraries like AI based docling, claude, openai, gemini, meta's llama-vision, unstructured-io, and pdfminer, pymupdf, pdfplumber etc for efficient snapshot, text, table, and metadata extraction.

Language: Python - Size: 3.01 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 111 - Forks: 27

wyattferguson/pokerstars-tempest-bot

A bot that plays Tempest Poker on Poker Stars.

Language: Python - Size: 7.26 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 1

TimInTech/pdf-text-duplicate-checker

PDF Duplicate Detector & Mover (Text + Image Hashing)

Language: Python - Size: 98.4 MB - Last synced at: about 1 month ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

cjspd-oly/Achievement-Auto-Marker-for-Paimon.moe

Automatically mark your Genshin Impact achievements using just a recorded video on website paimon.moe. *Join Discord for help*.

Language: Python - Size: 5.64 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

FREDERICO23/docling_ocr

A powerful Python package for extracting text from images and documents using the SmolDocling-256M-preview advanced LLM-based models.

Language: Python - Size: 26.4 KB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 11 - Forks: 1

bnvulpe/code-extractor

Transforming images into code at a click. Upload a photo or screenshot and copy the code to your script in seconds!

Language: HTML - Size: 299 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

montedev0516/mvp-MDH

AI based MDH - Multi-Tenant Dispatch Hub SaaS. Login credentials (user: Taras, pwd: mdhtarasbliva)

Language: Python - Size: 1.41 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

SeekAI-786/Smart-Glasses-For-Blind-People

Developed a prototype of smart glasses aimed at supporting blind and visually impaired individuals through real time object detection and text reading capabilities.

Language: Python - Size: 2.06 MB - Last synced at: 3 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

PRITHIVSAKTHIUR/Multimodal-OCR

Vision Language Model : tailored for tasks that involve [messy] optical character recognition (ocr), image-to-text conversion, and math problem solving with latex formatting.

Language: Python - Size: 12.5 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 5 - Forks: 0

Jayakrishnan-mk/User-Doc-Management-NEST-JS

User-Document-Management System - Modular Nest Js Backend. This is a production-ready NestJS backend application for user and document management.

Language: TypeScript - Size: 163 KB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

Hassan1222003/telegram-ocr-bot

Extract text from images using the Telegram OCR Bot. Enjoy multi-language support, inline feedback, and Gemini AI for enhanced accuracy. 🚀🤖

Language: Python - Size: 11.7 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

LongIsHandsome/trocr-webapp

A web application for recognising handwriting using TrOCR model

Language: CSS - Size: 53.7 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

PRITHIVSAKTHIUR/OCR-Optical-Character-Recognition

OCR stands for optical character recognition. It is also known as an optical character reader (OCR) or text recognition.

Language: Python - Size: 496 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 6 - Forks: 0

TechyCSR/AdvAITelegramBot

Telegram Advance AI ChatBot: GPT-4.1, Qwen-3, DeepSeek-R1, Dall-E-3, Flux, Flux-Pro, Dall-E Model, OCR and Google Voice2Text.

Language: Python - Size: 12.2 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 10 - Forks: 2

iremalgul/ocr_streamlit

PDF OCR ile Metin Çıkarma ve Analiz

Language: Python - Size: 445 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Yashh1524/OCR-Tool-App-using-PyTesseract

A simple and efficient Optical Character Recognition (OCR) tool built with Python and pytesseract.

Language: Python - Size: 14.6 KB - Last synced at: 3 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

sageil/ai-ocr

AI driven OCR

Language: Python - Size: 502 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

Samuela31/Sanskrit-Manuscripts-Revival-Using-Deep-Learning-Techniques

Restoring destroyed text in ancient Sanskrit manuscripts by predicting missing text using deep learning techniques. Mini project done in 3rd year of college using RoBERTa LLM, Tesseract OCR, and OpenCV.

Language: Jupyter Notebook - Size: 17.7 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

NekoImageLand/EasyPaddleOCR

A simple package for PaddleOCR on CPU and GPU using PyTorch

Language: Python - Size: 23.7 MB - Last synced at: about 2 months ago - Pushed at: 9 months ago - Stars: 12 - Forks: 2

HamidRezaAttar/Persian-OCR-Streamlit

Persian OCR allows users to scan documents and extract text from scanned image.

Language: Python - Size: 2.77 MB - Last synced at: 4 months ago - Pushed at: almost 3 years ago - Stars: 13 - Forks: 4

lorenzobabini/Tesseract-GUI-to-create-training-data Fork of dshea89/tesseract-retraining-pipeline

Intuitive interface to create new Ground Truth training data for fine-tuning a Tesseract OCR model

Language: Python - Size: 2.7 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

TsvetanG2/Advanced-Local-OCR

Advanced local OCR is a project, inspired by the text extraction some AIs do. So instead of leaving people paying for such services, why not publish a open-source version, that keeps the privacy of each user. The app allows integration with LLMs via APIs.

Language: Python - Size: 101 KB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

jprmaulion/japanese-receipt-ocr-cv

OCR text detection and recognition on a Japanese store invoice using PaddleOCR

Language: Jupyter Notebook - Size: 2.93 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

snakers4/silero-ocr

Nice, clean and minimalistic OCR pipeline for Russian and English.

Size: 1000 Bytes - Last synced at: 19 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Atul-vaibhav/OCR-Extraction-Using-Python

Extract text from images and PDFs using python and store in a JSON Format. Store the extracted in MYSQL database.

Language: Python - Size: 743 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

SYAAGalib/project_ocr

This code can do ocr

Language: Python - Size: 1.23 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 4 - Forks: 0

akayg/Podez

Podez is a smart CS project that turns handwritten or printed Python code into executable programs. Using AI, OCR, and Google Gemini, it extracts, refines, and runs code from images—all in a secure, interactive space. It’s a powerful mix of machine learning, image processing, and web tech for real-world code automation.

Language: Python - Size: 22.5 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

Duongbe/Read-electricity-meter

Bài tập lớn học phần TPTM&NNTM - Lớp CNTT 15-02 - Khoa Công nghệ thông tin - Đại học Đại Nam

Language: C++ - Size: 18.4 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

onk2cell/ocr_fast_api

Made with❤️ love by O Game

Language: Python - Size: 281 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

jWinman91/AI-OCR-Frontend

An AI-powered, but model-agnostic (Optical-Character-Recognition) OCR tool (frontend)

Language: Python - Size: 43 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 1

jWinman91/AI-OCR

An AI-powered, but model-agnostic (Optical-Character-Recognition) OCR tool

Language: Python - Size: 79.1 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 2 - Forks: 0

synth-studio/bombie-bot

Telegram BOMBIE BOT of Catizen Ecosystem.

Language: Python - Size: 984 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

zmandyhe/pdf-to-csv

Python scripts to convert PDF files to text or csv files.

Language: Python - Size: 509 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

rdantassilva/pdf2ocr

A CLI tool to apply OCR on PDF files and export to multiple formats

Language: Python - Size: 1.46 MB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

Sakibalam03/resume-scanner

🔍 AI-powered resume scanner that ranks candidates by semantic similarity to job descriptions. Supports PDF/DOCX/images with OCR fallback and sentence transformer embeddings for intelligent matching beyond keywords.

Language: Python - Size: 388 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

riccardogiorato/together-ai-vision-examples

Together AI SDK Vision and OCR examples in Typescript and Python

Language: Python - Size: 25.4 KB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Rishi-Solanki07/GEM-Tender-Document-Downloader

GEM-Tender-Document-Downloader makes it easy to get tender documents with one click. Just paste the reference IDs, handles captcha, downloads the files, and gives a full report in minutes

Language: Jupyter Notebook - Size: 738 KB - Last synced at: 4 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

hiroi-sora/Umi-OCR_v2

结束和新的开始

Language: QML - Size: 292 MB - Last synced at: 6 months ago - Pushed at: almost 2 years ago - Stars: 943 - Forks: 80

wihanga-dilantha/Flowchart-Generate-using-sinhala-AL-IT-questions

This project takes an image of a Sinhala A/L Information Technology (IT) flowchart question, translates it to English, uses a fine-tuned GPT-2 model to understand the logic, and then creates a visual flowchart using Graphviz. It includes OCR, Google Translate, and a user-friendly Streamlit interface.

Language: Jupyter Notebook - Size: 18.6 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

Related Keywords

ocr-python 422 ocr 222 python 142 ocr-recognition 139 tesseract-ocr 65 ocr-text-reader 55 python3 52 opencv 47 tesseract 39 image-processing 33 computer-vision 31 opencv-python 29 machine-learning 29 pytesseract 28 optical-character-recognition 20 pdf 18 easyocr 17 ai 15 streamlit 14 flask 14 paddleocr 13 deep-learning 13 ocr-engine 12 pytorch 11 text-recognition 10 docker 10 object-detection 10 pillow 9 artificial-intelligence 9 fastapi 9 django 9 nlp 8 data-science 8 llm 8 tkinter-gui 7 image 7 pytesseract-ocr 7 cv2 7 text-detection-recognition 7 natural-language-processing 7 handwritten-text-recognition 7 pandas 6 text-extraction 6 text-detection 6 yolov8 6 pdf-converter 6 automation 6 javascript 5 ocr-service 5 pdf-document-processor 5 nlp-machine-learning 5 api 5 mysql-database 5 license-plate-recognition 5 converter 5 html 5 openai 5 image-recognition 5 tensorflow 4 table-extraction 4 pymupdf 4 flask-application 4 regex 4 snipping-tool 4 python-3 4 image-to-text 4 tesseract-python 4 tts 4 sql 4 docker-compose 4 handwriting-recognition 4 tkinter-python 4 open-source 4 paddlepaddle 4 html-css-javascript 4 python-script 4 qt 4 text-processing 4 pyqt5 4 windows 4 rag 4 nodejs 4 webapp 4 telegram 4 machine-learning-algorithms 4 telegram-bot 4 gemini-ai 4 text 3 pdf-ocr-extraction 3 django-project 3 neural-network 3 jupyter-notebook 3 poppler 3 translation 3 ml 3 django-rest-framework 3 screenshot 3 llms 3 azure 3 persian-ocr 3