An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: ocr-engine

TsvetanG2/Advanced-Local-OCR

Advanced local OCR is a project, inspired by the text extraction some AIs do. So instead of leaving people paying for such services, why not publish a open-source version, that keeps the privacy of each user. The app allows integration with LLMs via APIs.

Language: Python - Size: 0 Bytes - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

NMAC427/SwiftOCR

Fast and simple OCR library written in Swift

Language: Swift - Size: 11.1 MB - Last synced at: about 3 hours ago - Pushed at: over 4 years ago - Stars: 4,633 - Forks: 479

tesseract-ocr/tesseract

Tesseract Open Source OCR Engine (main repository)

Language: C++ - Size: 51.1 MB - Last synced at: 9 days ago - Pushed at: 29 days ago - Stars: 67,675 - Forks: 9,966

kaelzhang/penteract-ocr

⭐️ The native node.js bindings to the Tesseract OCR project.

Language: C++ - Size: 1.01 MB - Last synced at: 2 days ago - Pushed at: almost 7 years ago - Stars: 126 - Forks: 13

bhimrazy/receipt-ocr

Efficient OCR engine for receipt image processing using Python, FastAPI, and Tesseract

Language: Python - Size: 22.8 MB - Last synced at: about 1 month ago - Pushed at: 7 months ago - Stars: 82 - Forks: 21

ahmetozlu/signature_extractor

A super lightweight image processing algorithm for detection and extraction of overlapped handwritten signatures on scanned documents using OpenCV and scikit-image.

Language: Python - Size: 3.9 MB - Last synced at: 29 days ago - Pushed at: about 2 years ago - Stars: 489 - Forks: 148

voun7/Subtitle_OCR

Using deep learning with PyTorch for a specialized subtitle text detection and recognition.

Language: Python - Size: 7.43 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 2 - Forks: 0

php-ocr/ocr-space-engine

Engine using the https://ocr.space API.

Language: PHP - Size: 18.6 KB - Last synced at: 26 days ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

AbdulAHAD968/TAP-MAN

A simple phone number extractor and validator using Tesseract OCR.

Language: JavaScript - Size: 35.2 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 3 - Forks: 0

ushelp/EasyOCR 📦

Java OCR 识别组件(基于Tesseract OCR 引擎)。能自动完成图片清理、识别 CAPTCHA 验证码图片内容的一体化工作。Java Image cleanup, OCR recognition component (based Tesseract OCR engine, automatically cleanup image and identification CAPTCHA verification code picture content).

Language: Java - Size: 629 KB - Last synced at: 2 months ago - Pushed at: almost 4 years ago - Stars: 617 - Forks: 245

stacksapien/react-tesseract-ocr

Tesseract OCR implementation in React JS

Language: JavaScript - Size: 46.9 KB - Last synced at: 3 months ago - Pushed at: over 5 years ago - Stars: 19 - Forks: 14

MiniAiLive/.github

Size: 47.9 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 15 - Forks: 2

leferrad/OCReract.jl

A simple Julia wrapper for Tesseract OCR

Language: Julia - Size: 792 KB - Last synced at: 3 days ago - Pushed at: 11 months ago - Stars: 28 - Forks: 9

UB-Mannheim/mocrin

Multiple OCR-engine interface

Language: Python - Size: 1.05 MB - Last synced at: 3 months ago - Pushed at: over 6 years ago - Stars: 9 - Forks: 3

adityamehra/Business-Card-Reader-BCR-

Android app to extract name, email and phone from business card using OCR library tess-two (Fork of Tesseract Tools for Android) and phone's camera.

Language: Java - Size: 18.3 MB - Last synced at: 14 days ago - Pushed at: over 5 years ago - Stars: 74 - Forks: 31

Elanchezhian2712/FastAPI-Image-to-Text-Conversion-Project

Welcome to the FastAPI Image-to-Text Conversion Project! This repository provides a FastAPI-based application designed to convert images into text using Optical Character Recognition (OCR). Users can upload images, extract text, and submit the extracted data to a database through a user-friendly web interface.

Language: Python - Size: 2.99 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

php-ocr/core

Core modules and interfaces.

Language: PHP - Size: 15.6 KB - Last synced at: 24 days ago - Pushed at: over 3 years ago - Stars: 5 - Forks: 0

baris-acar-dev/OCR

Language: Python - Size: 3.91 KB - Last synced at: about 9 hours ago - Pushed at: 11 months ago - Stars: 0 - Forks: 1

festivitymishra/PyraDox

PyraDox is a python tool which helps in document digitization by extracting text information and masking of personal information with the help of Tesseract-ocr.

Language: Python - Size: 6.43 MB - Last synced at: 10 months ago - Pushed at: over 4 years ago - Stars: 19 - Forks: 14

LATIS-DocumentAI-Group/ocr-microservice

This microservice standardizes the usage of Optical Character Recognition (OCR) engines

Language: Python - Size: 28.3 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

gojiplus/abbyyR 📦

R Client for the Abbyy Cloud OCR

Language: HTML - Size: 4.05 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 42 - Forks: 11

slompo/Tesseract-Example

Java Tesseract 3.4.4 Example

Language: Java - Size: 41.1 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 5 - Forks: 2

dhruv2601/Business-Card-Scanner

Engine for Optical Character Recognition to scan Business Cards locally on Android devices.

Language: C - Size: 80.3 MB - Last synced at: about 1 year ago - Pushed at: almost 8 years ago - Stars: 25 - Forks: 15

geekalaa/OCRJS

Extracting characters from image using tesseract.js (OCR) Javascript .

Language: Hack - Size: 31.3 MB - Last synced at: over 1 year ago - Pushed at: about 4 years ago - Stars: 2 - Forks: 0

giruu/TesserXtract.AI

This Flask application empowers users to seamlessly upload image files like invoices or receipts, extract text using robust OCR technologies, and efficiently isolate key fields using precise regular expressions and multiprocessing to streamline data extraction and enhance productivity.

Language: Python - Size: 159 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

lakshay1296/OCR_Conversion_JPEG2PDF

JPEG to PDF conversion using tesseract v4 through cmd. Includes OCR'ing the JPEG's and combining multi-page PDF to one.

Language: Roff - Size: 9.94 MB - Last synced at: over 1 year ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 1

mohamedgamalmoha/Image-To-Text

This system is a RESTful API that takes an image file as input and returns the text content of the image as output. The system uses the Tesseract OCR engine to extract text from the image.

Language: Python - Size: 17.6 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

jnoxro/joker

A high speed character recognition engine

Language: C++ - Size: 460 KB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

bityigoss/mtl-text-recognition

multi-task learning for text recognition with joint CTC-attention

Language: Python - Size: 1.91 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 111 - Forks: 37

Aadv1k/OctetOCR

Octet is an exploratory OCR or text recognition library to prepare and train upon raw data

Language: C - Size: 184 KB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 1

Eroica/greedy-ocr 📦

An OCR engine that works by finding pre-known letters in a word's image

Language: Lua - Size: 13.1 MB - Last synced at: almost 2 years ago - Pushed at: almost 6 years ago - Stars: 12 - Forks: 2

Hassi34/Optical-Character-Recognition

Vision AI service ( REST API ) for OCR ( optical character recognition ) 📷

Language: Python - Size: 34.2 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

Artificial-Brix/BRIX-OCR

Hello, this is Brix OCR, an open-source project where we try to build an OCR engine with the help of others as well as make a custom model.

Language: Jupyter Notebook - Size: 47.3 MB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 3

elvinagam/Tesseract-OCR-AZE

Tesseract on few examples, such as template matching for different languages (AZE)

Language: Jupyter Notebook - Size: 9.65 MB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 2 - Forks: 0

spajai/Ocr-OcrSpace

Perl Interface for Optical Character Recognition https://ocr.space/ (https://metacpan.org/pod/OCR::OcrSpace)

Language: Perl - Size: 14.6 KB - Last synced at: about 1 month ago - Pushed at: almost 4 years ago - Stars: 0 - Forks: 0

batux/staff_identity_card_ocr_project

Staff Identity Card OCR Project

Language: Python - Size: 33 MB - Last synced at: 3 months ago - Pushed at: over 6 years ago - Stars: 16 - Forks: 7

TungstenTransformation/Vidado

How to integrate Vidado Read OCR into Kofax Transformation's Advanced Zone Locator

Language: Visual Basic .NET - Size: 42 KB - Last synced at: over 1 year ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 0

anselcorona/WekaOCR

Data Mining class capstone project.

Language: Java - Size: 18.9 MB - Last synced at: over 2 years ago - Pushed at: almost 8 years ago - Stars: 0 - Forks: 0

php-ocr/http

Modules and interfaces for HTTP engines.

Language: PHP - Size: 9.77 KB - Last synced at: 12 days ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

rohitsroch/OCR

A basic ocr pipeline code written in tensorflow for CPU on EMNIST dataset

Language: Python - Size: 223 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 2 - Forks: 1

PresentIDco/Credit-Card-OCR

PresentID Optical character recognition API can extract data from all types of cards including driver's license, National ID card, Certificate, etc.

Size: 6.84 KB - Last synced at: 15 days ago - Pushed at: almost 4 years ago - Stars: 1 - Forks: 2

bq-xiao/ocr-demo

OCR Demo

Language: Python - Size: 134 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

simranjeet97/GoogleVision_OCR

GoogleVision_OCR Project to Read out PDF

Language: Python - Size: 16.6 KB - Last synced at: 4 months ago - Pushed at: over 4 years ago - Stars: 0 - Forks: 0

farbod-s/Text-Recognition

Undergraduate Final Project

Size: 1.61 MB - Last synced at: 4 months ago - Pushed at: over 5 years ago - Stars: 6 - Forks: 0

Cool-fire/Snipps

📚 📝📜 A simple android app to convert information into digital snippets, allows to extract text using OCR.

Language: Java - Size: 3.35 MB - Last synced at: over 2 years ago - Pushed at: over 5 years ago - Stars: 4 - Forks: 0

itsayushthada/PGMs

Repository for tasks like Representation, Inference and Learning of Probabilistic Graphical Models.

Language: MATLAB - Size: 4.74 MB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 0 - Forks: 0

tsudnevitz/HardyBits.Ocr

.NET OCR Engine based on Tesseract 4

Language: C# - Size: 68.6 MB - Last synced at: 7 months ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 0

Techget/Text-Recognition

Language: Python - Size: 58.9 MB - Last synced at: over 2 years ago - Pushed at: about 7 years ago - Stars: 0 - Forks: 0

nepalihackers/nepali-ocr

Optical Character Recognition framework and application for Nepali Language

Size: 4.88 KB - Last synced at: over 2 years ago - Pushed at: about 8 years ago - Stars: 0 - Forks: 0

TheOpenDevProject/ocradjs-browser

OCRAD.js wrapped up neatly for building into your browser based projects for Webpack

Language: JavaScript - Size: 620 KB - Last synced at: 4 months ago - Pushed at: almost 9 years ago - Stars: 0 - Forks: 0

sgino209/OCR_NaturalPhotos

OCR of Natural Photos using Machine-Learning techniques (based on Python sklearn toolkit)

Language: Python - Size: 15.6 KB - Last synced at: over 2 years ago - Pushed at: over 8 years ago - Stars: 0 - Forks: 0