GitHub topics: ocr-python
mathstava/Jk_tech-User-Doc-Management
User and Document Management is a robust NestJS backend solution designed for efficient user and document handling. This repository includes features like JWT authentication, PostgreSQL integration, and comprehensive testing to ensure reliability. 🐙📄
Language: TypeScript - Size: 166 KB - Last synced at: about 7 hours ago - Pushed at: about 8 hours ago - Stars: 0 - Forks: 0

TechyCSR/AdvAITelegramBot
Telegram Advance AI ChatBot: GPT-4.1, Qwen-3, DeepSeek-R1, Dall-E3, Flux, Flux-Pro, Dall-E Model, OCR and Google Voice2Text.
Language: Python - Size: 7.82 MB - Last synced at: about 8 hours ago - Pushed at: about 9 hours ago - Stars: 9 - Forks: 2

Jayakrishnan-mk/Jk_tech-User-Doc-Management
User-Document-Management System
Language: TypeScript - Size: 156 KB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

TRZMIELLL/TourismusGardeshgari-Card-Scanner
A powerful OCR tool for extracting information from Tourismus Gardeshgari bank cards. Extracts card numbers, expiry dates, CVV codes, and Sheba numbers with high accuracy using advanced image processing techniques.
Language: Python - Size: 293 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

breezedeus/CnOCR Fork of diaomin/crnn-mxnet-chinese-text-recognition
CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scenarios and can be used directly after installation. 【基于 PyTorch/MXNet 的中文/英文 OCR Python 包。】
Language: Python - Size: 17.4 MB - Last synced at: 3 days ago - Pushed at: 6 months ago - Stars: 3,557 - Forks: 522

genieincodebottle/parsemypdf
Collection of PDF parsing libraries like AI based docling, claude, openai, llama-vision, unstructured-io, and pdfminer, pymupdf, pdfplumber etc for efficient snapshot, text, table, and metadata extraction.
Language: Python - Size: 2.75 MB - Last synced at: 3 days ago - Pushed at: about 2 months ago - Stars: 82 - Forks: 23

Duongbe/Read-electricity-meter
Bài tập lớn học phần TPTM&NNTM - Lớp CNTT 15-02 - Khoa Công nghệ thông tin - Đại học Đại Nam
Language: C++ - Size: 18.4 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

onk2cell/ocr_fast_api
Made with❤️ love by O Game
Language: Python - Size: 281 KB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

voun7/VidSubX
A program for extracting hard coded (burned in) subtitle from a video and generating an external subtitle.
Language: Python - Size: 309 MB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 24 - Forks: 4

jWinman91/AI-OCR-Frontend
An AI-powered, but model-agnostic (Optical-Character-Recognition) OCR tool (frontend)
Language: Python - Size: 43 KB - Last synced at: 5 days ago - Pushed at: 8 days ago - Stars: 1 - Forks: 1

oidlabs-com/Lexoid
Multimodal document parser for high quality data understanding and extraction
Language: Python - Size: 46.7 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 62 - Forks: 8

hiroi-sora/Umi-OCR
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
Language: Python - Size: 268 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 34,281 - Forks: 3,421

jWinman91/AI-OCR
An AI-powered, but model-agnostic (Optical-Character-Recognition) OCR tool
Language: Python - Size: 79.1 KB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 2 - Forks: 0

synth-studio/bombie-bot
Telegram BOMBIE BOT of Catizen Ecosystem.
Language: Python - Size: 984 KB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 0 - Forks: 0

CatchTheTornado/text-extract-api
Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown
Language: Python - Size: 5.07 MB - Last synced at: 9 days ago - Pushed at: about 1 month ago - Stars: 2,598 - Forks: 213

zmandyhe/pdf-to-csv
Python scripts to convert PDF files to text or csv files.
Language: Python - Size: 509 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

Sakibalam03/resume-scanner
🔍 AI-powered resume scanner that ranks candidates by semantic similarity to job descriptions. Supports PDF/DOCX/images with OCR fallback and sentence transformer embeddings for intelligent matching beyond keywords.
Language: Python - Size: 388 KB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

ankandrew/fast-plate-ocr
Lightweight & fast OCR models for license plate text recognition.
Language: Python - Size: 267 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 163 - Forks: 29

riccardogiorato/together-ai-vision-examples
Together AI SDK Vision and OCR examples in Typescript and Python
Language: Python - Size: 25.4 KB - Last synced at: 9 days ago - Pushed at: 15 days ago - Stars: 0 - Forks: 0

Rishi-Solanki07/GEM-Tender-Document-Downloader
GEM-Tender-Document-Downloader makes it easy to get tender documents with one click. Just paste the reference IDs, handles captcha, downloads the files, and gives a full report in minutes
Language: Jupyter Notebook - Size: 738 KB - Last synced at: 5 days ago - Pushed at: 18 days ago - Stars: 0 - Forks: 0

Kuju29/TextPhantomOCR_Overlay
🧠 A Chrome extension that performs OCR on images and overlays translated text in real-time — perfect for manga, webtoons, and image-based content.
Language: JavaScript - Size: 153 KB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 1 - Forks: 0

hiroi-sora/Umi-OCR_v2
结束和新的开始
Language: QML - Size: 292 MB - Last synced at: 16 days ago - Pushed at: over 1 year ago - Stars: 943 - Forks: 80

shibing624/imgocr
Python3 package for Chinese/English OCR, with paddleocr-v4 onnx model(~14MB). 基于ppocr-v4-onnx模型推理,可实现 CPU 上毫秒级的 OCR 精准预测,通用场景中英文OCR达到开源SOTA。
Language: Python - Size: 27.6 MB - Last synced at: 6 days ago - Pushed at: 5 months ago - Stars: 82 - Forks: 11

fabriziosalmi/pdf-ocr
Converts scanned PDF documents to multiple formats using Optical Character Recognition
Language: HTML - Size: 28.8 MB - Last synced at: 23 days ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

wihanga-dilantha/Flowchart-Generate-using-sinhala-AL-IT-questions
This project takes an image of a Sinhala A/L Information Technology (IT) flowchart question, translates it to English, uses a fine-tuned GPT-2 model to understand the logic, and then creates a visual flowchart using Graphviz. It includes OCR, Google Translate, and a user-friendly Streamlit interface.
Language: Jupyter Notebook - Size: 18.6 KB - Last synced at: 26 days ago - Pushed at: 26 days ago - Stars: 0 - Forks: 0

rdantassilva/pdf2ocr
A CLI tool to apply OCR on PDF files and export to multiple formats
Language: Python - Size: 1.28 MB - Last synced at: 17 days ago - Pushed at: 17 days ago - Stars: 1 - Forks: 0

KaranVishwakarma-1807/MNIST-CNN-Digit-Recognition
Convolutional Neural Network (CNN) for handwritten digit recognition using the MNIST dataset with TensorFlow/Keras — a simple Optical Character Recognition (OCR) demo.
Language: Python - Size: 6.84 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

gnana70/tamil_ocr
OCR Tamil is a powerful tool that can detect and recognize text in Tamil images with high accuracy on Natural Scenes
Language: Python - Size: 820 MB - Last synced at: 10 days ago - Pushed at: 3 months ago - Stars: 64 - Forks: 11

StabRise/ScaleDP
ScaleDP is an Open-Source extension of Apache Spark for Document Processing
Language: Python - Size: 8.16 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 11 - Forks: 0

JustForCodin/invoice-parser
InvoiceParser in an application to scan your invoices and store the data in your account. It is powered by such technologies as YOLOv8 neural network and OCR.
Language: Python - Size: 1.13 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

mixpeek/top-ocr-libraries
Most popular open source OCR libraries listed by accuracy and speed
Size: 4.88 KB - Last synced at: 1 day ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 1

abstra-app/template-quote-proposal
Quote Proposal Workflow with AI automatic quotations based on a proposal + PDF Generation with quotation + Email notification to the customer.
Language: Python - Size: 977 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 1

pilarcode/receipt-ocr
Named entity recognition (NER). Extraction of features from images of receipts with different formats. #NER #OCR 🛒🏷️
Language: Jupyter Notebook - Size: 7.63 MB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

longlivedayo/Kanji-Database-for-Isaac-Awokoya
Using Machine Learning with Japanese Kanji
Language: Jupyter Notebook - Size: 12.9 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

pk5ls20/EasyPaddleOCR
A simple package for PaddleOCR on CPU and GPU using PyTorch
Language: Python - Size: 23.7 MB - Last synced at: 16 days ago - Pushed at: 3 months ago - Stars: 12 - Forks: 1

mariam-khediri/PixOCR-mini
PixOCR Mini Project is an end-to-end OCR pipeline built using Python and Tesseract to extract text from diverse document types. It explores preprocessing techniques to improve recognition accuracy on real-world scanned and colored images.
Language: Jupyter Notebook - Size: 559 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

sergiocorreia/quipucamayoc
dev repo for article
Language: Python - Size: 30.3 MB - Last synced at: 27 days ago - Pushed at: about 2 years ago - Stars: 29 - Forks: 5

log-ai-n/Screw-Snail-Mail
A simple tool for capturing, analyzing, and organizing physical mail using computer vision and AI.
Language: Python - Size: 51.8 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

diamond-cz/hishot
一款基于 python + pyqt5 开发的屏幕截屏(screenshot&ocr&translate)工具
Language: Python - Size: 20.5 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

bentoml/BentoOCR
Turn any OCR models into online inference API endpoint 🚀 🌖
Language: Python - Size: 2.83 MB - Last synced at: 6 days ago - Pushed at: 3 months ago - Stars: 55 - Forks: 4

knochenhans/ocrreader2
OCR GUI based on Python and Qt6 designed to prepare and OCR images with complex layouts, mainly for Tesseract OCR.
Language: Python - Size: 482 KB - Last synced at: 9 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Magken/Leetcode_Solver
LeetCode Screenshot OCR Solver
Language: Jupyter Notebook - Size: 7.81 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Atul-vaibhav/OCR-Extraction-Using-Python
Extract text from images and PDFs using python and store in a JSON Format. Store the extracted in MYSQL database.
Language: Python - Size: 740 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

maxent-ai/ocrpy
OCR, Archive, Index and Search: Implementation agnostic OCR framework.
Language: Jupyter Notebook - Size: 32.4 MB - Last synced at: 21 days ago - Pushed at: over 1 year ago - Stars: 223 - Forks: 11

benjiden-dev/PDF2OCR
A python script that monitors a folder and uses Google Gemini API to rename PDF files placed inside
Language: Python - Size: 10.7 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Psarpei/Multi-Type-TD-TSR
Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition:
Language: Jupyter Notebook - Size: 16.8 MB - Last synced at: 2 months ago - Pushed at: almost 3 years ago - Stars: 272 - Forks: 53

moheladwy/OCR4Linux
OCR Script Tool for Extracting Text from Screenshots (images) using bash, and python scripts only
Language: Python - Size: 38.1 KB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 11 - Forks: 1

FREDERICO23/docling_ocr
A powerful Python package for extracting text from images and documents using the SmolDocling-256M-preview advanced LLM-based models.
Language: Python - Size: 26.4 KB - Last synced at: 17 days ago - Pushed at: 3 months ago - Stars: 11 - Forks: 1

moi15moi/VideoSubOCR
OCR automation for VideoSubFinder
Language: Python - Size: 49.8 KB - Last synced at: 6 days ago - Pushed at: over 2 years ago - Stars: 11 - Forks: 4

rahulsamant37/Aahdar-Card-Info-Extractor
This application extracts Aadhaar card information, including the Aadhaar number, name, date of birth, and gender, from uploaded images using OCR, with image preprocessing for improved accuracy and robust error handling.
Language: Python - Size: 24.4 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

ZephyrusBlaze/StudyBuddy-AI
StudyBuddy is an AI-powered web app that helps students summarize notes, generate practice questions, and get answers to specific study material queries. It supports PDFs, images, and text files, making learning more efficient and interactive.
Language: HTML - Size: 0 Bytes - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

hanifabd/pisahkan-ktp
Python Package for Information Extraction and Segmentation - Segmentasi KTP Indonesia - Indonesian ID Card - Information Segmentation
Language: Python - Size: 534 KB - Last synced at: 24 days ago - Pushed at: 4 months ago - Stars: 6 - Forks: 1

StabRise/ScaleDP-Tutorials
Tutorials for ScaleDP library. ScaleDP is an Open-Source Library for Processing Documents in Apache Spark.
Language: Jupyter Notebook - Size: 11.2 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

Fahazavana/ScreenShot_OCR
Simple python GUI app to convert screenshot to text (OCR)
Language: Python - Size: 1.95 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

shiv1810/Bill-Scanner-OCR
Basic Image-To-Text model that is further optimized to give/autofill the information extracted from the image
Language: Python - Size: 26.4 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

carlosacchi/captiocr
CaptiOCR - A real-time screen text extraction tool using Tesseract OCR. Capture, recognize, and log on-screen text dynamically. Future updates will include on-demand language installation, resizable selection areas, and live text overlays.
Language: Python - Size: 685 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

AltayYuzeir/Pdf2Docx-PaddleOCR-UI
📚 Pdf to Docx Converter with PaddleX & PaddleOCR UI
Language: Python - Size: 574 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 2 - Forks: 0

mlodyjesienin/Computational-Methods-2024
Solutions and projects for Computational Methods in Science & Technology course, covering many topics.
Language: Jupyter Notebook - Size: 192 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

amajji/Crop-and-OCR-documents-and-deployment-using-FastAPI-and-DOCKER
Crop and OCR documents and deployment using FastAPI and DOCKER
Language: Jupyter Notebook - Size: 22.3 MB - Last synced at: 2 months ago - Pushed at: about 2 years ago - Stars: 7 - Forks: 0

goldenryu2000/Discord-OCR-Bot
This is an OCR Bot for Discord made using OpenCV and Pytesseract
Language: Python - Size: 703 KB - Last synced at: 3 months ago - Pushed at: almost 3 years ago - Stars: 9 - Forks: 5

jdhao/anti-ocr
A tool to generate text images that are hard for OCR engine to recognize and understand.
Language: Python - Size: 1000 Bytes - Last synced at: 6 days ago - Pushed at: about 5 years ago - Stars: 9 - Forks: 2

LATIS-DocumentAI-Team/DocumentAI-std
DocumentAI-std is a Python library designed to facilitate and standardize document analysis and processing tasks. It offers functionality for handling document elements, performing optical character recognition (OCR), and managing document datasets.
Language: Python - Size: 350 KB - Last synced at: 8 days ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

xtekky/zefoy-captcha-solver
Zefoy OCR captcha solver | 99% accurate
Language: Python - Size: 27.3 KB - Last synced at: about 2 months ago - Pushed at: almost 3 years ago - Stars: 33 - Forks: 8

FardinHash/EasyOCR-based-Automatic-Bangla-License-Plate-Recognition
EasyOCR is basically Optical Character Reading package that belongs from PyTorch. Using this texts from the images can be extracted easily, documents, texts can be scanned. For License Plate's Number Recognition, it can be applicable easily as it can extract the texts. About License Plate's Number, there are several language's character plates are in the world, Bangla is one of them. Here EasyOCR is applied for Bangla Character Based License Plate Recognition.
Language: Jupyter Notebook - Size: 435 KB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 14 - Forks: 3

FurkanKhann/ScreenAi
An AI-powered clipboard tool that fetches responses from models like Gemini instantly when text is copied. Plans include integrating OCR for handling non-copiable text and enhancing the UI for a seamless experience.
Language: Python - Size: 4.88 KB - Last synced at: 2 months ago - Pushed at: 4 months ago - Stars: 1 - Forks: 0

kartikmehta8/basic-surya-ocr
Basic Implementation of Surya OCR [EN]
Language: Python - Size: 2.24 MB - Last synced at: about 2 months ago - Pushed at: 10 months ago - Stars: 6 - Forks: 0

abizovnuralem/ocr
OCR Microservice Setup Guide for Hackernoon
Language: Python - Size: 21.5 KB - Last synced at: 4 days ago - Pushed at: 4 months ago - Stars: 5 - Forks: 3

wyattferguson/pokerstars-tempest-bot
A bot that plays Tempest Poker on Poker Stars.
Language: Python - Size: 12.3 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 1

MMuflih-1/road-monitor-system
Road Monitor System is a powerful system designed to detect trucks and 18-wheelers, capturing license plates accurately and efficiently. Using cutting-edge computer vision and machine learning, it improves vehicle tracking and enhances road management. Perfect for traffic analysis, logistics, and intelligent transportation systems.
Language: Python - Size: 8.43 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

RajasekharRapaka/Gen-AI-App-for-Visually-Impaired
Building AI Powered Solution for Assisting Visually Impaired Individuals
Language: Python - Size: 476 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

VerisimilitudeX/ocr_pdf2txt
Use Optical Character Recognition technology to convert scanned PDFs into TXT files locally.
Language: Python - Size: 525 KB - Last synced at: 11 days ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

ENGINEER-MUHAMMAD-SHAHZAIB/PDF2TXT-OCR
PDF2TXT-OCR is a powerful tool that adds an OCR text layer to scanned PDFs, making them searchable and editable. It supports multiple languages and ensures high-quality, searchable PDF/A output while preserving original image resolution.
Language: Python - Size: 5.38 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

jschmidaguerre/OCR-LLM-Document-Processing-Application
This project is a web application that allows users to upload documents and process them using AWS Textract for Optical Character Recognition (OCR). Additionally, it implements a Large Scale Language Model (LLM) to improve the accuracy and processing of the extracted information, providing key data in clean and structured JSON format.
Language: Python - Size: 170 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 1

Y1D1R/PyFacture
PyFacture is a Python project designed to automate expense management from receipts. The application utilizes image processing techniques and Optical Character Recognition (OCR) using Tesseract and Llama3.2-vision to extract relevant information from a photo of a receipt.
Language: Python - Size: 817 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

kylekce/Comic-Lens
Comic Lens is an open-source desktop application designed for both readers and translators alike, offering a seamless solution for manga, comics, and webcomic translation. This tool enable users to effortlessly translate characters or texts with precision and ease.
Language: Python - Size: 57.2 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 2 - Forks: 1

junalmeida/homeassistant-addons
Home Assistant add ons, home to the Utility Meter Parser MQTT add on.
Size: 16.9 MB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 7 - Forks: 0

Zreechxnn/servo-controller-OCR
Servo Controller-OCR integrates computer vision, OCR, and Arduino to control a servo motor based on text detection from a webcam. It uses Python for real-time image processing and Tesseract OCR for text recognition, combined with Arduino to handle servo motor operations. Ideal for automation projects requiring text-based triggers.
Language: Python - Size: 43 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0

prys0000/archives-handwriting-text-extract-project
Project files, scripts, configurations, and workflow publications for the Archives-Textract Test Project
Language: Python - Size: 61.3 MB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

boshyxd/ResumeOCR
Python tool that converts multiple resume images to searchable text files using OCR technology
Language: Python - Size: 5.59 MB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

kshrugalj/Lex-Med
This is for my LexMed project that I had done.
Language: Python - Size: 88.5 MB - Last synced at: 2 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

minnukota381/flask-ocr-app
A web application that allows users to upload an image and convert it to text using Optical Character Recognition (OCR) technology. This application supports user authentication and provides a user-friendly interface for image uploads and text extraction.
Language: Python - Size: 52.7 KB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 8 - Forks: 0

AlexBandurin/Menu_Reader
This is a web application that converts restaurant menus into text using OCR. That text is then sent through a Machine Learning model to output a list of menu items using classification and NLP.
Language: Jupyter Notebook - Size: 9.37 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 8 - Forks: 1

douyacun/ocr-to-docx
Language: Python - Size: 50.8 KB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 10 - Forks: 1

anjali76Codes/DocuLens---Automatic_Document_Verification
Automatic Document Verification System: A robust MERN stack-based application that automates document verification by allowing users to upload Aadhar cards and other documents for text extraction and Face API matching. Admins can approve or reject verifications, with mismatches flagged and users notified of results.
Language: JavaScript - Size: 16.6 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 2

pranav-0309/OCR_model_dc
OCR model to extract a primary and a secondary ID, for each image-insurance type pair.
Language: Jupyter Notebook - Size: 2.55 MB - Last synced at: 2 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

prathamesh-patil-5090/Image_Recognition
An image recognition project that leverages deep learning techniques to classify and analyze images. The model is built using Python and TensorFlow/Keras, with a focus on recognizing and categorizing objects from various image datasets.
Language: HTML - Size: 3 MB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

julicq/nexus-ocr
A CLI tool and WebApp to do OCR and extract text from a given scanned document image.
Language: Python - Size: 6.46 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

hiroshil/chromiumLensOCR_Python Fork of dimdenGD/chrome-lens-ocr
Library to use Google Lens OCR for free, via API used in Chromium. AI-converted Python code, edited by me.
Language: Python - Size: 318 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

khaouitiabdelhakim/ArabicOCR-Python-Tutorial
This project uses the ArabicOcr package to convert Arabic text in images to editable text using OCR techniques.
Language: Python - Size: 611 KB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 2 - Forks: 0

XenosWarlocks/Image_Text_Extractor
A Python-based tool for batch processing and extracting text from images using OCR (Tesseract). The extracted text is cleaned by removing unwanted terms, and potential names are identified and formatted. Results are saved in a structured text file for easy reference. Ideal for automating data extraction and preprocessing tasks.
Language: Python - Size: 48.8 KB - Last synced at: 19 days ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

Anant2003jain/TextExtractify
TextExtractify is an AI-powered tool that extracts text from images and PDFs using both Azure OCR and EasyOCR. It offers features like multi-image upload, text entity extraction, and .docx export for premium users. Designed to streamline document processing with fast, accurate text extraction.
Language: Python - Size: 85.4 MB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

r-limpz/archival-system
ARDS (Archival Record Digitization System) is a web-based application that uses OCR to convert physical academic records into editable, searchable digital formats. It addresses challenges such as diverse layouts, varying text quality, and security, enhancing record management efficiency.
Language: JavaScript - Size: 17.6 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

rvats20/llama-OCR
Language: Jupyter Notebook - Size: 2.62 MB - Last synced at: 2 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

MohammedNasserAhmed/arabic-pdf-chat
Arabic Chat with PDF is a user-friendly application that lets you interact with Arabic PDF documents. Powered by advanced language models, OCR, and vector search, it allows you to upload PDFs, ask questions, and receive accurate Arabic responses 🚀
Language: Python - Size: 84 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

HamidRezaAttar/Persian-OCR-Streamlit
Persian OCR allows users to scan documents and extract text from scanned image.
Language: Python - Size: 2.77 MB - Last synced at: 7 months ago - Pushed at: over 2 years ago - Stars: 11 - Forks: 4

fescofesco/MtG-OCR
Identificy Set and Name information from Magic the Gathering Card Images
Language: Python - Size: 29.5 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 1

andshrew/PlayStation-Voucher-Prices
This repository contains an example for scraping pricing data for a product from an online retailer that displays their product prices within a Base64 encoded image.
Language: Python - Size: 463 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

Adeyemi0/Python-OCR
This code extracts texts from images
Language: Python - Size: 44.9 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

ajxv/pyocr-flask
Pdf OCR text extraction using python
Language: HTML - Size: 7.81 KB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

FeelsBotMan/KoreanText-Recognition
이 저장소는 여러 오픈소스 OCR 엔진의 한글 텍스트 인식 정확도를 비교하는 프로젝트입니다.
Language: Python - Size: 8.79 KB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0
