GitHub topics: ocr-engine
TsvetanG2/Advanced-Local-OCR
Advanced local OCR is a project, inspired by the text extraction some AIs do. So instead of leaving people paying for such services, why not publish a open-source version, that keeps the privacy of each user. The app allows integration with LLMs via APIs.
Language: Python - Size: 0 Bytes - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

NMAC427/SwiftOCR
Fast and simple OCR library written in Swift
Language: Swift - Size: 11.1 MB - Last synced at: about 3 hours ago - Pushed at: over 4 years ago - Stars: 4,633 - Forks: 479

tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository)
Language: C++ - Size: 51.1 MB - Last synced at: 9 days ago - Pushed at: 29 days ago - Stars: 67,675 - Forks: 9,966

kaelzhang/penteract-ocr
⭐️ The native node.js bindings to the Tesseract OCR project.
Language: C++ - Size: 1.01 MB - Last synced at: 2 days ago - Pushed at: almost 7 years ago - Stars: 126 - Forks: 13

bhimrazy/receipt-ocr
Efficient OCR engine for receipt image processing using Python, FastAPI, and Tesseract
Language: Python - Size: 22.8 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 82 - Forks: 21

ahmetozlu/signature_extractor
A super lightweight image processing algorithm for detection and extraction of overlapped handwritten signatures on scanned documents using OpenCV and scikit-image.
Language: Python - Size: 3.9 MB - Last synced at: 29 days ago - Pushed at: about 2 years ago - Stars: 489 - Forks: 148

voun7/Subtitle_OCR
Using deep learning with PyTorch for a specialized subtitle text detection and recognition.
Language: Python - Size: 7.43 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

php-ocr/ocr-space-engine
Engine using the https://ocr.space API.
Language: PHP - Size: 18.6 KB - Last synced at: 26 days ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

AbdulAHAD968/TAP-MAN
A simple phone number extractor and validator using Tesseract OCR.
Language: JavaScript - Size: 35.2 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

ushelp/EasyOCR 📦
Java OCR 识别组件(基于Tesseract OCR 引擎)。能自动完成图片清理、识别 CAPTCHA 验证码图片内容的一体化工作。Java Image cleanup, OCR recognition component (based Tesseract OCR engine, automatically cleanup image and identification CAPTCHA verification code picture content).
Language: Java - Size: 629 KB - Last synced at: 2 months ago - Pushed at: almost 4 years ago - Stars: 617 - Forks: 245

stacksapien/react-tesseract-ocr
Tesseract OCR implementation in React JS
Language: JavaScript - Size: 46.9 KB - Last synced at: 3 months ago - Pushed at: over 5 years ago - Stars: 19 - Forks: 14

MiniAiLive/.github
Size: 47.9 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 15 - Forks: 2

leferrad/OCReract.jl
A simple Julia wrapper for Tesseract OCR
Language: Julia - Size: 792 KB - Last synced at: 3 days ago - Pushed at: 11 months ago - Stars: 28 - Forks: 9

UB-Mannheim/mocrin
Multiple OCR-engine interface
Language: Python - Size: 1.05 MB - Last synced at: 3 months ago - Pushed at: over 6 years ago - Stars: 9 - Forks: 3

adityamehra/Business-Card-Reader-BCR-
Android app to extract name, email and phone from business card using OCR library tess-two (Fork of Tesseract Tools for Android) and phone's camera.
Language: Java - Size: 18.3 MB - Last synced at: 14 days ago - Pushed at: over 5 years ago - Stars: 74 - Forks: 31

Elanchezhian2712/FastAPI-Image-to-Text-Conversion-Project
Welcome to the FastAPI Image-to-Text Conversion Project! This repository provides a FastAPI-based application designed to convert images into text using Optical Character Recognition (OCR). Users can upload images, extract text, and submit the extracted data to a database through a user-friendly web interface.
Language: Python - Size: 2.99 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

php-ocr/core
Core modules and interfaces.
Language: PHP - Size: 15.6 KB - Last synced at: 24 days ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 0

baris-acar-dev/OCR
Language: Python - Size: 3.91 KB - Last synced at: about 9 hours ago - Pushed at: 11 months ago - Stars: 0 - Forks: 1

festivitymishra/PyraDox
PyraDox is a python tool which helps in document digitization by extracting text information and masking of personal information with the help of Tesseract-ocr.
Language: Python - Size: 6.43 MB - Last synced at: 10 months ago - Pushed at: over 4 years ago - Stars: 19 - Forks: 14

LATIS-DocumentAI-Group/ocr-microservice
This microservice standardizes the usage of Optical Character Recognition (OCR) engines
Language: Python - Size: 28.3 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

gojiplus/abbyyR 📦
R Client for the Abbyy Cloud OCR
Language: HTML - Size: 4.05 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 42 - Forks: 11

slompo/Tesseract-Example
Java Tesseract 3.4.4 Example
Language: Java - Size: 41.1 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 5 - Forks: 2

dhruv2601/Business-Card-Scanner
Engine for Optical Character Recognition to scan Business Cards locally on Android devices.
Language: C - Size: 80.3 MB - Last synced at: about 1 year ago - Pushed at: almost 8 years ago - Stars: 25 - Forks: 15

geekalaa/OCRJS
Extracting characters from image using tesseract.js (OCR) Javascript .
Language: Hack - Size: 31.3 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

giruu/TesserXtract.AI
This Flask application empowers users to seamlessly upload image files like invoices or receipts, extract text using robust OCR technologies, and efficiently isolate key fields using precise regular expressions and multiprocessing to streamline data extraction and enhance productivity.
Language: Python - Size: 159 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

lakshay1296/OCR_Conversion_JPEG2PDF
JPEG to PDF conversion using tesseract v4 through cmd. Includes OCR'ing the JPEG's and combining multi-page PDF to one.
Language: Roff - Size: 9.94 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 1

mohamedgamalmoha/Image-To-Text
This system is a RESTful API that takes an image file as input and returns the text content of the image as output. The system uses the Tesseract OCR engine to extract text from the image.
Language: Python - Size: 17.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

jnoxro/joker
A high speed character recognition engine
Language: C++ - Size: 460 KB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

bityigoss/mtl-text-recognition
multi-task learning for text recognition with joint CTC-attention
Language: Python - Size: 1.91 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 111 - Forks: 37

Aadv1k/OctetOCR
Octet is an exploratory OCR or text recognition library to prepare and train upon raw data
Language: C - Size: 184 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 1

Eroica/greedy-ocr 📦
An OCR engine that works by finding pre-known letters in a word's image
Language: Lua - Size: 13.1 MB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 12 - Forks: 2

Hassi34/Optical-Character-Recognition
Vision AI service ( REST API ) for OCR ( optical character recognition ) 📷
Language: Python - Size: 34.2 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

Artificial-Brix/BRIX-OCR
Hello, this is Brix OCR, an open-source project where we try to build an OCR engine with the help of others as well as make a custom model.
Language: Jupyter Notebook - Size: 47.3 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 3

elvinagam/Tesseract-OCR-AZE
Tesseract on few examples, such as template matching for different languages (AZE)
Language: Jupyter Notebook - Size: 9.65 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

spajai/Ocr-OcrSpace
Perl Interface for Optical Character Recognition https://ocr.space/ (https://metacpan.org/pod/OCR::OcrSpace)
Language: Perl - Size: 14.6 KB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

batux/staff_identity_card_ocr_project
Staff Identity Card OCR Project
Language: Python - Size: 33 MB - Last synced at: 3 months ago - Pushed at: over 6 years ago - Stars: 16 - Forks: 7

TungstenTransformation/Vidado
How to integrate Vidado Read OCR into Kofax Transformation's Advanced Zone Locator
Language: Visual Basic .NET - Size: 42 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 0

anselcorona/WekaOCR
Data Mining class capstone project.
Language: Java - Size: 18.9 MB - Last synced at: over 2 years ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

php-ocr/http
Modules and interfaces for HTTP engines.
Language: PHP - Size: 9.77 KB - Last synced at: 12 days ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

rohitsroch/OCR
A basic ocr pipeline code written in tensorflow for CPU on EMNIST dataset
Language: Python - Size: 223 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 1

PresentIDco/Credit-Card-OCR
PresentID Optical character recognition API can extract data from all types of cards including driver's license, National ID card, Certificate, etc.
Size: 6.84 KB - Last synced at: 15 days ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 2

bq-xiao/ocr-demo
OCR Demo
Language: Python - Size: 134 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

simranjeet97/GoogleVision_OCR
GoogleVision_OCR Project to Read out PDF
Language: Python - Size: 16.6 KB - Last synced at: 4 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

farbod-s/Text-Recognition
Undergraduate Final Project
Size: 1.61 MB - Last synced at: 4 months ago - Pushed at: over 5 years ago - Stars: 6 - Forks: 0

Cool-fire/Snipps
📚 📝📜 A simple android app to convert information into digital snippets, allows to extract text using OCR.
Language: Java - Size: 3.35 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 0

itsayushthada/PGMs
Repository for tasks like Representation, Inference and Learning of Probabilistic Graphical Models.
Language: MATLAB - Size: 4.74 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

tsudnevitz/HardyBits.Ocr
.NET OCR Engine based on Tesseract 4
Language: C# - Size: 68.6 MB - Last synced at: 7 months ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

Techget/Text-Recognition
Language: Python - Size: 58.9 MB - Last synced at: over 2 years ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

nepalihackers/nepali-ocr
Optical Character Recognition framework and application for Nepali Language
Size: 4.88 KB - Last synced at: over 2 years ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 0

TheOpenDevProject/ocradjs-browser
OCRAD.js wrapped up neatly for building into your browser based projects for Webpack
Language: JavaScript - Size: 620 KB - Last synced at: 4 months ago - Pushed at: almost 9 years ago - Stars: 0 - Forks: 0

sgino209/OCR_NaturalPhotos
OCR of Natural Photos using Machine-Learning techniques (based on Python sklearn toolkit)
Language: Python - Size: 15.6 KB - Last synced at: over 2 years ago - Pushed at: over 8 years ago - Stars: 0 - Forks: 0
